<<

remote sensing

Article Passive MIMO Radar Detection with Unknown Colored Gaussian

Yongjun Liu 1,*, Guisheng Liao 1, Haichuan Li 1, Shengqi Zhu 1, Yachao Li 1 and Yingzeng Yin 2

1 National Laboratory of Radar Signal Processing, Xidian University, Xi’an 710071, China; [email protected] (G.L.); [email protected] (H.L.); [email protected] (S.Z.); [email protected] (Y.L.) 2 National Key Laboratory of Antennas and Microwave Technology, Xidian University, Xi’an 710071, China; [email protected] * Correspondence: [email protected]

Abstract: The target detection of the passive multiple-input multiple-output (MIMO) radar that is comprised of multiple illuminators of opportunity and multiple receivers is investigated in this paper. In the passive MIMO radar, the transmitted signals of illuminators of opportunity are totally unknown, and the received signals are contaminated by the colored Gaussian noise with an unknown covariance matrix. The generalized likelihood ratio test (GLRT) is explored for the passive MIMO radar when the channel coefficients are also unknown, and the closed-form GLRT is derived. Compared with the GLRT with unknown transmitted signals and channel coefficients but a known covariance matrix, the proposed method is applicable for a more practical case whenthe covariance matrix of colored noise is unknown, although it has higher computational complexity. Moreover, the proposed GLRT can achieve similar performance as the GLRT with the known covariance matrix

 when the number of training samples is large enough. Finally, the effectiveness of the proposed  GLRT is verified by several numerical examples.

Citation: Liu, Y.; Liao, G.; Li, H.; Zhu, S.; Li, Y.; Yin, Y. Passive MIMO Keywords: radar detection; passive radar; colored Gaussian noise; generalized likelihood ratio test; Radar Detection with Unknown multiple-input multiple-output Colored Gaussian Noise. Remote Sens. 2021, 13, 2708. https://doi.org/ 10.3390/rs13142708 1. Introduction Academic Editor: Andrzej Stateczny In the field of radar, passive radar has been an important research area [1–28]. In passive radar, the transmitters are usually the noncooperative illuminators of opportu- Received: 18 May 2021 nity. These illuminators can be digital audio broadcast stations [26], digital television Accepted: 6 July 2021 stations [27], or commercial cellular phone base stations [28]. Compared with active Published: 9 July 2021 radar [29–39], passive radar requires less infrastructure, and is also covert and low-cost, since passive radar does not require to be equipped with its own transmitters. Due to this, Publisher’s Note: MDPI stays neutral passive radar has been attracted great attention. with regard to jurisdictional claims in Compared with active radar that knows the transmitted signal, passive radar does published maps and institutional affil- not know the transmitted signal, which complicates target detection for passive radar. iations. For passive radar detection, according to the property of the transmitted signal of the illuminator of opportunity, the unknown transmitted signal is usually modeled as either a stochastic model or a deterministic model. For the stochastic model, the transmitted signal is usually modeled as a stochastic process, while it is deterministic but unknown Copyright: © 2021 by the authors. for the deterministic model. For the deterministic model, the traditional method that Licensee MDPI, Basel, Switzerland. utilizes the correlation between the direct-path (transmitter to receiver) and target-path This article is an open access article (transmitter to target to receiver) signals is proposed in [1]. However, the performance of distributed under the terms and the method in [1] will be significantly degraded when the signal-to-noise ratio (SNR) of the conditions of the Creative Commons received direct-path signal is low. To improve the detection performance of passive radar, Attribution (CC BY) license (https:// creativecommons.org/licenses/by/ the generalized likelihood ratio test (GLRT) [40] is proposed in [2], in which the unknown 4.0/). parameters are replaced with the corresponding maximum likelihood estimates (MLEs).

Remote Sens. 2021, 13, 2708. https://doi.org/10.3390/rs13142708 https://www.mdpi.com/journal/remotesensing Remote Sens. 2021, 13, 2708 2 of 20

Moreover, for a bistatic passive polarimetric radar network, the GLRT is explored in [4]. For the stochastic model, the GLRT for a multistatic passive radar is investigated in [5], where the unknown transmitted signal of the illuminator of opportunity is treated as a stochastic process. In general, the transmit power of the employed noncooperative illuminators of op- portunity in passive radar is low, which limits the performance of target detection. Never- theless, the detection performance of passive radar can be improved by utilizing multiple illumators and receivers [7], named passive multiple-input multiple-output (MIMO) radar in [23]. In [8], a linear fusion method is proposed for the passive MIMO radar. In this method, for each receiver and transmitter pair of the passive MIMO radar, the cross- correlation between the received target-path and direct-path signals is computed first, and then the weighted combination of those cross correlations forms the final test. The weights are optimized according to a revised deflection coefficient criterion. For passive MIMO radar comprised of multiple illuminators, the GLRT is investigated in [9]. To obtain the GLRT requires obtaining the unknown parameters estimations. In [41], based on the expectation-maximization principle, the authors propose an iterative estimator to estimate the target delay and Doppler for the bistatic radar, where the unknown signal transmit- ted from the noncooperative illuminators of opportunity is molded as an autoregressive process. Furthermore, in [42] the authors provide a survey of parameter estimation and target detection with limited data for the phased-array radar, distributed MIMO radar, and passive radar when the parametric auto-regressive model is used for target detection. To derive a closed-form detector for the passive MIMO radar, the Rao test is derived in [10]. To reduce the computational complexity, the authors devise a detector for passive/active MIMO radar in [11]. In [12], the signal transmitted by the noncooperative illuminator of opportunity is assumed to be a discrete-time, zero-mean, second-order cyclostationary signal. The most locally powerful, invariant-test-inspireddetector is proposed for the MIMO passive bistatic radar system, which consists of a transmitter, a reference array, and a surveillance array. In [13], the signal sequence of the sources of opportunity is treated as Gaussian random vectors with zero mean and unknown covariance matrices, and the in each surveillance channel and reference channel are assumed to be either indepen- dent with identical variances or arbitrarily correlated. Under those assumptions, the GLRT is investigated. In [16], both the Bayesian test and GLRT are derived for passive MIMO radar when the noise variance is either known or unknown. In [17], the signal format of the communication signal is exploited to improve the detection performance of the passive MIMO radar when the variance of is known/unknown. Moreover, in [18], the GLRT for the passive MIMO radar is derived when the signal format of the communication signal is known but the covariance matrix of the colored noise is unknown. In [19], it is assumed that the direct-path signals are not available in the passive MIMO radar, and the preamble information of the transmitted signals of noncooperative illuminators of opportunity is known. Moreover, in [19], the noise is assumed to be Gaussian white noise, and the GLRTs are, respectively, investigated when the white noise variance is known or unknown. In practice, the signal format and preamble information of transmitted signals may not be known, and the direct-path signals are usually available. In this case, the GLRT for the passive MIMO radar network as depicted in Figure1 is explored in [ 23]. In [23], the target-path and direct-path signals are totally unknown, and the covariance matrix of noise is known or the noise is already whitened. Whitening the noise is non-trivial and would require some sort of training samples. However, in practice, these whitened data are unlikely to be present. Remote Sens. 2021, 13, 2708 3 of 20

Receiver 1

Direct-path 1 Target-path 1 Direct-path 2 Target-path 2

Transmitter 1

Transmitter 2

Receiver 2

Figure 1. The notional passive MIMO radar consists of two multichannel receiver and two transmitters.

In this paper, we consider a more practical case where the covariance matrix of colored noise in a passive MIMO radar network is unknown, i.e., the colored noise is un-whitened, and the GLRT will be developed. The developed GLRT can simultaneously estimate the statistics of the colored noise while performing the passive radar target detection. This is a very practical problem: passive radar detection in un-whitened colored noise, for which GLRT-based algorithms have not yet been published to date. These algorithms are highly nontrivial and do not follow easily from the work in [23]. Moreover, the developed GLRT approaches the previous GLRT performance with the known covariance matrix when the number of training samples used to estimate the covariance matrix is increased. The major technical contributions of this paper are summarized as follows. (1) We proposed a target detection method for the passive MIMO radar when the transmitted signal, channel coefficients, and covariance matrix of colored Gaussian noise are unknown. (2) A closed-form GLRT is derived in this paper. The remainder of this paper is organized as follows. In Section2, the signal model and assumption are developed. In Section3, the GLRT for the passive MIMO radar with unknown colored noise is derived. Numerical results are presented in Section4. Finally, conclusions are drawn in Section5.

2. Signal Model and Assumption In this section, the assumptions of a passive MIMO radar network will be made and the signal model will be constructed. In this paper, we assume that there are Nr receivers and Nt transmitters in the passive MIMO radar network. In each receiver, there is an antenna array which can separate the direct-path and target-path signals by spatial filtering [23]. Suppose that the signals transmitted by different noncooperative illuminators of opportunity or transmitters occupy different frequency bands (In a cellular system, this is usually true for the signals transmit- ted by neighboring base stations in order to minimize the mutual interference.) [23]. Hence, the received signals due to the transmission from different transmitter can be separated by frequency domain. Moreover, suppose that in the received signals, the clutter has been suppressed by clutter cancellation techniques [20,43], which is the same assumption made in [23]. For each hypothesized range-Doppler cell under test, passive MIMO radar requires to determine whether there is a target or not. For each range-Doppler cell under test, the Doppler shift and range are known, which can be used to compensate the Doppler shift and time delay of the received signal. Following the compensation in [23], after delay-Doppler compensation, the target- path signal (surveillance channel signal) vector of N time samples at the jth receiver due Remote Sens. 2021, 13, 2708 4 of 20

s h s s to the transmission from the ith transmitter is represented as yi,j = yi,j(0), yi,j(1),..., T s i yi,j(N − 1) , and the direct-path signal (reference channel signal) vector of N time samples at the jth receiver due to the transmission from the ith transmitter is represented as T r h r r r i yi,j = yi,j(0), yi,j(1),..., yi,j(N − 1) , for i = 0, 1, ... , Nt − 1 and j = 0, 1, ... , Nr − 1, as per the model in [23], where (·)T denotes the transpose operator. Similar to [29], we suppose that the training samples, i.e., secondary data, are available, and they have the same probability distribution as the colored Gaussian noise in the reference and surveillance channels (the training samples can be obtained by monitoring the frequency bands, which do not have a communication signal in them). Moreover, suppose that for each receiver and transmitter pair we can obtain K training samples, and the kth training samples obtained in the jth receiver and in the frequency band occupied by T n h n n n i the ith transmitter is yi,j,k = yi,j,k(0), yi,j,k(1),..., yi,j,k(N − 1) , for i = 0, 1, ... , Nt − 1, j = 0, 1, . . . , Nr − 1, and k = 0, 1, . . . , K − 1. For the target detection in passive MIMO radar, under hypothesis H1, i.e., target present, the compensated signals and secondary data can be represented as

s s s H1 : yi,j = µi,jsi + ni,j r r r yi,j = µi,jsi + ni,j n n yi,j,k = ni,j,k. (1)

Under hypothesis H0, i.e., target absent, the compensated signals and secondary data can be described as

s s H0 : yi,j = ni,j r r r yi,j = µi,jsi + ni,j n n yi,j,k = ni,j,k. (2)

r s In (1) and (2), µi,j and µi,j respectively denote the complex reference and surveillance chan- T nel coefficients from the ith transmitter to the jth receiver, si = [si(0), si(1),..., si(N − 1)] is an N × 1 vector collecting the unknown transmitted signal samples of the ith transmitter, T s h s s s i r h r r the N × 1 vectors ni,j = ni,j(0), ni,j(1),..., ni,j(N − 1) and ni,j = ni,j(0), ni,j(1),..., T r i ni,j(N − 1) denote the surveillance and reference channel noise at the jth receiver fo- cused on the transmission from the ith transmitter, respectively, and the N × 1 vector T n h n sn n i ni,j,k = ni,j,k(0), ni,j,k(1),..., ni,j,k(N − 1) denotes the kth training samples obtained at the jth receiver focused on the transmission from the ith transmitter. s r n Suppose that the colored noise vectors ni,j, ni,j, and ni,j,k, for i = 0, 1, ... , Nt − 1, j = 0, 1, ... , Nr − 1, k = 0, 1, ... , K − 1 are all independent identically distributed, and each × of them is the Gaussian random vector with zero mean and covariance matrix Σ ∈ CN N, where Σ is a positive definite matrix, which is assumed to be unknown in this paper.

3. GLRT for Passive MIMO Radar In this section, the probability distribution of the received signal is formulated and the GLRT for passive MIMO radar is investigated. Remote Sens. 2021, 13, 2708 5 of 20

3.1. Probability Distribution of Received Signal

Following the assumption in Section II, under hypothesis H1, the joint conditional probability density function (PDF) of the received target-path signal, direct-path signal, and secondary data in (1) is ( " Nt−1 Nr−1 K−1 H 1 r s n r s 1  n  −1 n p (y , y , y |µ , µ , s, Σ) = exp − yi j k Σ yi j k Nc0 | |c0 ∑ ∑ ∑ , , , , π Σ i=0 j=0 k=0 H H   s s  −1 s s   r r  −1 r r  + yi,j − µi,jsi Σ yi,j − µi,jsi + yi,j − µi,jsi Σ yi,j − µi,jsi , (3)

H where c0 = Nt Nr(K + 2), |·| represents the determinant of a matrix, (·) denotes Hermi-  T T T  T tian conjugate operator, yr = yr  , yr  ,..., yr represents the received signal 0 1 Nt−1 h iT vector that collects all the reference channel samples with yr = yr , yr ,..., yr , i i,0 i,1 i,Nr−1  T T T  T ys = ys  , ys  ,..., ys denotes the received signal vector that collects all 0 1 Nt−1 h iT h T T the surveillance channel samples with ys = ys , ys ,..., ys , yn = yn , yn , i i,0 i,1 i,Nr−1 0 1  TT ..., yn represents the received signal vector that collects all the training sam- Nt−1  T  T  TT h iT ples with yn = yn , yn ,..., yn and yn = yn , yn ,..., yn , i i,0 i,1 i,Nr−1 i,j i,j,0 i,j,1 i,j,K−1  T T T  T µr = µr  , µr  ,..., µr denotes the vector that collects all the reference chan- 0 1 Nt−1  T h iT T T  T nel coefficients with µr = µr , µr ,..., µr , µs = µs  , µs  ,..., µs i i,0 i,1 i,Nr−1 0 1 Nt−1 s represents the vector that collects all the surveillance channel coefficients with µi = h iT h iT µs , µs ,..., µs , and s = (s )T, (s )T,..., (s )T collects the s for all i. i,0 i,1 i,Nr−1 0 1 Nt−1 i Under hypothesis H0, the joint conditional PDF of the received target-path and direct- path signals, and secondary data in (2) is ( " Nt−1 Nr−1 K−1 H 0 r s n r 1  n  −1 n p (y , y , y |µ , s, Σ) = exp − yi j k Σ yi j k Nc0 | |c0 ∑ ∑ ∑ , , , , π Σ i=0 j=0 k=0 H H   r r  −1 r r   s  −1 s + yi,j − µi,jsi Σ yi,j − µi,jsi + yi,j Σ yi,j . (4)

3.2. GLRT Derivation According to the Neyman–Pearson Lemma [44,45], we can obtain that the likelihood ratio test has the largest detection probability when the probability of false alarm (Pfa) is fixed. Nevertheless, for the problem considered in this paper, the likelihood ratio test can not be obtained because the PDF of the received signals depends on the unknown channel coefficients, transmitted signals, and covariance matrix of colored noise. To solve this problem, a typical method is to use the maximum likelihood estimates of unknown parameters as the true values. This method is called GLRT. The GLRT for our problem will be investigated in this subsection. For the passive MIMO radar, the GLRT can be written as

1 r s n r s H maxµr,µs,s,Σ p (y , y , y |µ , µ , s, Σ) 1 ≷ γ, (5) r 0( r s n| r ) maxµ ,s,Σ p y , y , y µ , s, Σ H0 Remote Sens. 2021, 13, 2708 6 of 20

where γ is the detection threshold which is usually determined according to a desired Pfa in practice. The GLRT in (5) can also be written as

H 1 r s n r s 0 r s n r 1 0 max l (y , y , y |µ , µ , s, Σ) − max l (y , y , y |µ , s, Σ) ≷ γ , (6) µr µs s Σ µr s Σ , , , , , H0

where γ0 = ln γ, l0(yr, ys, yn| µr, s, Σ) = ln p0(yr, ys, yn|µr, s, Σ) and l1(yr, ys, yn|µr, µs, s, Σ) 1 r s n r s = ln p (y , y , y |µ , µ , s, Σ) denote the log-likelihood functions under H0 and H1, re- spectively. From (3), the log-likelihood function under H1 is " Nt−1 Nr−1 K−1 H 1 r s n r s  n  −1 n l (y , y , y |µ , µ , s, Σ) = c1 − c0 ln|Σ|− ∑ ∑ ∑ yi,j,k Σ yi,j,k i=0 j=0 k=0 H H   s s  −1 s s   r r  −1 r r  + yi,j − µi,jsi Σ yi,j − µi,jsi + yi,j − µi,jsi Σ yi,j − µi,jsi , (7)

where c1 = −NNt Nr(K + 2) ln π. Similarly, from (4), the log-likelihood function under H0 is " Nt−1 Nr−1 K−1 H 0 r s n r  n  −1 n l (y , y , y |µ , s, Σ) =c1 − c0 ln|Σ|− ∑ ∑ ∑ yi,j,k Σ yi,j,k i=0 j=0 k=0 H H   r r  −1 r r   s  −1 s + yi,j − µi,jsi Σ yi,j − µi,jsi + yi,j Σ yi,j . (8)

In order to obtain the optimal solution to the problem in (6), the optimal solution to the optimization problem

max l1(yr, ys, yn|µr, µs, s, Σ) (9) µr,µs,s,Σ

is required to be obtained. To obtain the optimal solution to the optimization problem in (9), let the derivative of (7) with respect to the covariance matrix Σ be the zero matrix, and then we can achieve

Nt−1 Nr−1  H   H ˆ 1 r r r r s s s s Σ = ∑ ∑ yi,j − µi,jsi yi,j − µi,jsi + yi,j − µi,jsi yi,j − µi,jsi c0 i=0 j=0 # K−1 H n  n  + ∑ yi,j,k yi,j,k , (10) k=0

ˆ r s where Σ satisfies (9) for any si, µij, µij, i = 0, 1, . . . , Nt − 1, j = 0, 1, . . . , Nr − 1. According to Theorem 3.1.4 in [46], Σˆ will be a positive definite matrix with probability 1 if and only if

N 6 Nt Nr(K + 2). (11) Remote Sens. 2021, 13, 2708 7 of 20

Suppose that the inequation in (11) is satisfied. Replacing Σ in (7) with Σˆ in (10) and utilizing the formula xHAx = trAxxH,(7) can be rewritten as

Nt−1 Nr−1 K−1 H 1 r s n r s ˆ  ˆ n ˆ −1 n l y , y , y |µ , µ , s, Σ = c1 − c0 ln|Σ|− ∑ ∑ ∑ yi,j,k Σ yi,j,k i=0 j=0 k=0  H    H   r r ˆ −1 r r s s ˆ −1 s s + yi,j − µi,jsi Σ yi,j − µi,jsi + yi,j − µi,jsi Σ yi,j − µi,jsi

Nt−1 Nr−1 K−1  H ˆ ˆ −1 n n =c1 − c0 ln|Σ|−tr Σ ∑ ∑ ∑ yi,j,k yi,j,k i=0 j=0 k=0 H H  r r  r r   s s  s s  + yi,j − µi,jsi yi,j − µi,jsi + yi,j − µi,jsi yi,j − µi,jsi (12)

=c1 − c0 ln|Σˆ |−tr(Nt Nr(K + 2)IN) (13)

=c1 + c2 − c0 ln|Σˆ |, (14)

where in going from (12) to (13) we use (10), c2 = −NNt Nr(K + 2), and IN represents an N × N identity matrix. Let Ysr = [Ys, Yr] denote the N × 2Nr Nt matrix that collects all the received surveillance h i h i and reference channel signals, where Y = Ys , Ys ,..., Ys , Y = Yr , Yr ,..., Yr , s 0 1 Nt−1 r 0 1 Nt−1 h i h i Ys = ys , ys ,..., ys , and Yr = yr , yr ,..., yr , for i = 0, 1, ... , N − 1. i i,0 i,1 i,Nr−1 i i,0 i,1 i,Nr−1 t

Let S = [ s0, s1,..., sNt−1 ] denote an N × Nt matrix that collects all the unknown trans- mit signals. Let Usr = [Us, Ur] represent an Nt × 2Nt Nr matrix which collects all the surveil- n o lance and reference channel coefficients, where U = blkdiag (µs )T, (µs )T,..., (µs )T , s 0 1 Nt−1 n o U = blkdiag (µr )T, (µr )T,..., (µr )T , and blkdiag{A , A ,..., A } denote the r 0 1 Nt−1 0 1 m block diagonal matrix with the main diagonal elements being the matrix blocks. Then (10) can also be rewritten as

1 h H i Σˆ = (Ysr − SUsr)(Ysr − SUsr) + Rn , (15) c0 where Nt−1 Nr−1 K−1 H n  n  Rn = ∑ ∑ ∑ yi,j,k yi,j,k . (16) i=0 j=0 k=0 Using (15), the optimization problem in (9) can be rewritten as

max l1yr, ys, yn|µr, µs, s, Σˆ  = max l1yr, ys, yn|S, U , Σˆ  = max c + c − c ln|Σˆ |. (17) r s sr 1 2 0 µ ,µ ,s S,Usr S,Usr

We can see that the log-likelihood function in (17) can be viewed as either a function r s of µ , µ , s, and Σˆ , or a function of S, Usr, and Σˆ . Ignoring the constant terms in (17), the optimization problem in (17) can be rewritten as

min ln|Σˆ |. (18) S,Usr

Note that 1 ˆ = ( − )( − )H + Σ N Ysr SUsr Ysr SUsr Rn . (19) c0 Using Theorem 3.1.4 in [46], we can achieve that the matrix

H Q =Rn + (Ysr − SUsr)(Ysr − SUsr) = c0Σˆ (20) Remote Sens. 2021, 13, 2708 8 of 20

which is a positive definite matrix with probability one, if

N ≤ Nt Nr(K + 2). (21)

If the inequation in (21) satisfies, then Σˆ in (19) is a positive definite matrix with probability one. In order to minimize ln|Σˆ | with respect to S for any Usr as in (18), let the derivative of ln|Σˆ | with respect to S∗ be the zero matrix, i.e.,

∂ ln|Σˆ |  ∂Q    = tr Q−1 = −Q−1Y UH + Q−1SU UH = Q−1 SU UH − Y UH = 0, (22) ∂S∗ ∂S∗ sr sr sr sr sr sr sr sr

where (·)∗ represents the conjugate operator. Since the matrix Q−1 is a nonsingular matrix, from (22) we can obtain the S that satisfies (18) for any Usr as

 † ˆ H H S = YsrUsr UsrUsr , (23)

where (A)† denotes the Moore Penrose inverse of A. Using (23), the optimization problem in (18) can be rewritten as

˜  ˜ H min Rn + Ysr − YsrUsr Ysr − YsrUsr , (24) Usr

˜ H H† where Usr = Usr UsrUsr Usr. Let F be the object function in (24). We can achieve

= + − ˜  − ˜ H = + − ˜  − ˜ H H F Rn Ysr YsrUsr Ysr YsrUsr Rn Ysr I2Nr Nt Usr I2Nr Nt Usr Ysr ⊥ ⊥H H = Rn + YsrP H P H Y , (25) Usr Usr sr

where

† ⊥ H H P H =I2N N − U˜ sr = I2N N − U UsrU Usr (26) Usr r t r t sr sr

H is the orthogonal projection matrix of Usr [47]. ⊥ ⊥H ⊥ ⊥ ⊥ Note that P H = P H , and P H P H = P H . Hence, (25) can be simplified as Usr Usr Usr Usr Usr

⊥ ⊥H H ⊥ H F = Rn + YsrP H P H Y = Rn + YsrP H Y . (27) Usr Usr sr Usr sr

⊥ ⊥ Since P H is orthogonal projection matrix, the eigenvalues of P H are either 1 or 0. Usr Usr ⊥ Using Theorem 21.5.7 in [47], we can obtain that P H is orthogonally diagonalizable, i.e., Usr

  H  ⊥ H Im 0m×(2Nr Nt−m) Vm H P H =VDV = [Vm, V2N N −m] H = VmVm, (28) Usr r t 0 0 V (2Nr Nt−m)×m 2Nr Nt−m 2Nr Nt−m

where D = diag{Im, 02Nr Nt−m} with 0n denotes an n × n zero matrix, V is an orthog- H H onal matrix, i.e., VV = V V = I2Nr Nt , Vm collects the first m column vectors of V, V collects the last 2N N − m column vectors of V, VHV = I , VH V = 2Nr Nt−m r t m m m 2Nr Nt−m m  ⊥  0(2N N −m)×m with 0n×m denotes an n × m zero matrix, and m = ρ P H with ρ(·) repre- r t Usr sents the rank of a matrix. Plugging (28) into (27) yields

H H F = Rn + YsrVmVmYsr . (29) Remote Sens. 2021, 13, 2708 9 of 20

According to Theorem 3.1.4 in [46], if

Nt NrK ≥ N, (30)

the matrix Rn will be positively definite with probability one. We assume that (30) is sat- isfied. By using |An×nBn×n| = |An×n||Bn×n| and |Im + Am×nBn×m| = |In + Bn×mAm×n|, (29) can be rewritten as

−1 H H H H −1 F =|Rn| IN + Rn YsrVmVmYsr = |Rn| Ir + VmYsrRn YsrVm . (31)

H Since VmVm = Im,(31) can be simplified as

=| | H + H H −1 F Rn VmI2Nr Nt Vm VmYsrRn YsrVm   =| | H + H −1 Rn Vm I2Nr Nt YsrRn Ysr Vm

H =|Rn| VmΨVm , (32)

where

H −1 Ψ = I2Nr Nt + YsrRn Ysr (33)

is a 2Nr Nt × 2Nr Nt positive definite matrix. From (32), the optimization problem in (24) can be rewritten as the following opti- mization problem

H min F = |Rn| VmΨVm Vm (34) H subject to Vm Vm = Im. To solve the optimization problem in (34), we introduce Theorem 1 (the proof is given in AppendixA).

Theorem 1. Let A be an n × n positive Hermitian matrix, and Xm = [x1, x2,..., xm] be an n × m matrix with m ≤ n. The solution to

H arg min |Xm AXm| Xm (35) H subject to Xm Xm = Im

is Xˆ m = [xˆ1, xˆ2,..., xˆ m], which collects the m eigenvectors corresponding to the m smallest eigenvalues of A, i.e., xˆ i ∈ {v1, v2,..., vm} and xˆ i 6= xˆ j, for i 6= j, where v1, v2, ... , vm are the m eigenvectors corresponding to the m smallest eigenvalues of A. Moreover, the optimum value of the objective function in (35) is the product of m smallest eigenvalues of A.

ˆ According to Theorem 1, the optimal solution to (34) is Vm = [ψ1, ψ2,..., ψm] and the ˆ m optimal value is F(m) = |Rn| ∏i=1 λi, where λi, for i = 1, 2, ... , 2Nr Nt, is the eigenvalue of Ψ, ψ is the corresponding eigenvector, and 1 ≤ λ1 ≤ λ2 ≤ · · · ≤ λ2N N . Note that the i   r t ˆ( ) = ⊥ optimal value F m depends on the rank m ρ PUH . Obviously, the smaller the m, the smaller the optimal value Fˆ(m) will be. H H† H Since P H = U˜ = U U U U is the projection matrix of U and the rank of Usr sr sr sr sr sr sr H H   ⊥ Usr satisfies ρ Usr ≤ Nt, the rank of P H is ρ P H ≤ Nt. Since P H = I2Nr Nt − P H , the Usr Usr Usr Usr ⊥ rank of P H satisfies Usr

 ⊥    m =ρ P H = 2Nr Nt − ρ P H ≥ 2Nr Nt − Nt. (36) Usr Usr Remote Sens. 2021, 13, 2708 10 of 20

Using (36) and Theorem 1, we can obtain that the optimal solution to the optimization problem in (34) is h i Vˆ =Ψ = ψ , ψ ,..., ψ (37) 2Nr Nt−Nt 2Nr Nt−Nt 1 2 2Nr Nt−Nt

and the corresponding optimal value is

2Nr Nt−Nt ˆ F(2Nr Nt − Nt) = |Rn| ∏ λi. (38) i=1

The optimal solution Uˆ sr to the optimization problem in (24) is (the proof is shown in AppendixB)

ˆ = H H Usr Γ ΩΨNt , (39) h i where Ψ = ψ , ψ ,..., ψ , Ω is any N × N invertible diago- Nt 2Nr Nt−Nt+1 2Nr Nt−Nt+2 2Nr Nt t t H H nal matrix, and Γ is any Nt × Nt orthogonal matrix, i.e., Γ Γ = ΓΓ = INt . Using (9), (17), and (38), we can obtain

max l1(yr, ys, yn|µr, µs, s, Σ) = max l1yr, ys, yn|µr, µs, s, Σˆ  µr,µs,s,Σ µr,µs,s 1 r s n  = max l y , y , y |S, Usr, Σˆ = max c1 + c2 − c0 ln|Σˆ | S,Usr S,Usr ! Fˆ(2N N − N ) 2Nr Nt−Nt = + − r t t = + + − | | c1 c2 c0 ln N c1 c2 c3 c0 ln Rn ∏ λi , (40) c0 i=1

where c3 = NNt Nr(K + 2) ln[Nt Nr(K + 2)]. Similar to the previous processing, under the hypothesis H0, the following optimiza- tion problem is required to be solved

max l0(yr, ys, yn|µr, s, Σ). (41) µr,s,Σ

Using the previous estimation method, we can achieve that # Nt−1 Nr−1  H  H K−1  H ˆ 1 r r r r s s n n Σ = ∑ ∑ yi,j − µi,jsi yi,j − µi,jsi +yi,j yi,j + ∑ yi,j,k yi,j,k c0 i=0 j=0 k=0

1 h H i = (Yr − SUr)(Yr − SUr) + Rsn , (42) c0 r is the optimal solution to the optimization problem in (41) for any si, µij, i = 0, 1, ... , Nt − 1, j = 0, 1, . . . , Nr − 1, where

Nt−1 Nr−1 H Nt−1 Nr−1 K−1 H s  s  n  n  Rsn = ∑ ∑ yi,j yi,j + ∑ ∑ ∑ yi,j,k yi,j,k . (43) i=0 j=0 i=0 j=0 k=0

Substituting (42) into (8) with Σ = Σˆ produces

0 r s n r  l y , y , y |µ , s, Σˆ =c1 + c2 − c0 ln|Σˆ |. (44)

From (44), we can achieve that the optimization problem in (41) can be reformulated as

max l0yr, ys, yn|µr, s, Σˆ  = max l0yr, ys, yn|S, U , Σˆ  = max c + c − c ln|Σˆ |. (45) r r 1 2 0 µ ,s S,Ur S,Ur Remote Sens. 2021, 13, 2708 11 of 20

In (45), we can see that the log-likelihood function l0(·) can be viewed as either a r function of µ , s, and Σˆ , or a function of S, Ur, and Σˆ . Ignoring the constant terms in (45), the optimization problem in (45) can be rewritten as

min ln|Σˆ |. (46) S,Ur

Using Theorem 3.1.4 in [46], we can achieve that the matrix Σˆ is a positive definite matrix with probability one, if

N ≤ Nt Nr(K + 2). (47)

Assume that (47) is true. In order to minimize ln|Σˆ | with respect to S for any Ur as in (46), let the derivative of ln|Σˆ | with respect to S be the zero matrix, and we can achieve that  † ˆ H H S = YrUr UrUr . (48)

Using (48), the optimization problem in (46) can be rewritten as

˜  ˜ H min Yr − YrUr Yr − YrUr + Rsn , (49) Ur

˜ H H† where Ur = Ur UrUr Ur. Similar to previous processing, we can achieve that the optimal solution Uˆ r to the problem in (49) is

ˆ = 0H 0 0H Ur Γ Ω ΨNt , (50) 0 0 where Ω is any Nt × Nt invertible diagonal matrix, Γ is any Nt × Nt orthogonal matrix, h i and Ψ0 = ψ0 , ψ0 ,..., ψ0 , in which ψ0, for i = 1, 2, ··· , N N , is Nt Nr Nt−Nt+1 Nr Nt−Nt+2 Nr Nt i r t 0 the eigenvector corresponding to the eigenvalue λi of

0 H −1 Ψ = INr Nt + Yr Rsn Yr (51)

that is an N N × N N positive definite matrix, and 1 ≤ λ0 ≤ λ0 ≤ · · · ≤ λ0 . r t r t 1 2 Nr Nt The optimal value of (49) is

Nr Nt−Nt ˆ0 0 F (Nr Nt − Nt) = |Rsn| ∏ λi. (52) i=1

Using (41), (45), and (52), we can obtain

max l0(yr, ys, yn|µr, s, Σ) = max l0yr, ys, yn|µr, s, Σˆ  = max l0yr, ys, yn|S, U , Σˆ  r r r µ ,s,Σ µ ,s S,Ur 0 Fˆ (Nr Nt − Nt) = max c1 + c2 − c0 ln|Σˆ |= c1 + c2 − c0 ln S,U N r c0 Nr Nt−Nt ! 0 =c1 + c2 + c3 − c0 ln |Rsn| ∏ λi . (53) i=1

Substituting (40) and (53) into (6) yields

Nr N −N 0 t t H1 |Rsn| ∏i=1 λi 0 c0 ln 2N N −N ≷ γ , (54) r t t H |Rn| ∏i=1 λi 0 Remote Sens. 2021, 13, 2708 12 of 20

i.e., the GLRT is equivalent to

Nr N −N 0 t t H1 |Rsn| ∏i=1 λi 2N N −N ≷ γ˜, (55) r t t H |Rn| ∏i=1 λi 0

 0 where γ˜ = exp γ /c0 .

3.3. Performance Analysis and Discussion The computational complexity of the proposed GLRT in (55) is approximately 3 2 2 2 3 3  O 3N + 3N Nr Nt + 5NNr Nt + NNr Nt + 9Nr Nt + 2Nr NtK . The computational com- 2 3 3 plexity of the GLRT in [23] is approximately O NNr Nt + 9Nr Nt . Obviously, the com- putational complexity of the proposed method is higher than the GLRT in [23], since the proposed method does not know the covariance matrix of colored noise, which complicates the target detection and increases the computational complexity. In practice, the proposed GLRT will determine whether there is a target or not for each range and Doppler cell. Hence, for multiple targets that are in different range or Doppler cells, the proposed GLRT is applicable. That is to say, the proposed GLRT work for multi-target detection.

4. Simulation In this section, several simulations results are provided to show the performance of the proposed GLRT. In the following simulations, when the channel coefficients, transmitted signal, and covariance matrix of colored noise are unknown, the proposed GLRT is de- noted by ‘UK CovM’. When the channel coefficients and transmitted signals are unknown whereas the covariance matrix of colored noise is known, the GLRT in [23] is denoted by ‘K CovM’. In the simulations, the results are obtained by performing 105 Monte Carlo experi- ments. The number of transmitted signal samples, receivers, and transmitters is N = 10, Nr = 2, and Nt = 2, respectively. Furthermore, the transmitted signal of the ith transmitter N×1 is si = exp{jθi}, where θi ∈ R is a random phase vector with each independent compo- nent uniformly distributed on [0, 2π] (same as [23]). Suppose that all the independent noise r s n vectors in {ni,j, ni,j, and ni,j,k, for i = 0, 1, ... , Nt − 1, j = 0, 1, ... , Nr − 1, k = 0, 1, ... , K − 1} obey the same complex Gaussian distribution with zero mean and covariance matrix Σt, where the mth row and nth column element of Σt is m/n, for m ≤ n. The SNR of the received target-path signal is defined as

1 Nt−1 Nr−1 = k s k2 TNR 2 ∑ ∑ µijsi , (56) NNr Ntσ1 i=0 j=0

2 where σ1 represents the colored noise power in the surveillance channel. The SNR of the received direct-path signal is defined as

1 Nt−1 Nr−1 = k r k2 DNR 2 ∑ ∑ µijsi , (57) NNr Ntσ2 i=0 j=0

2 where σ2 represents the colored noise power in the reference channel. s r In the simulation, µij and µij are randomly drawn from the Gaussian distribution with zero mean and unit variance. Then, they are scaled to achieve the desired TNR and DNR according to (56) and (57), respectively.

4.1. Variation of Pd with Pfa

The dependence of the probability of detection (Pd) on Pfa is illustrated in Figure2. In Figure2a, the TNR is −15 dB and the DNR is −30 dB. We choose the number of training samples to be K = 3, K = 9, and K = 15, respectively. In Figure2b, the TNRs are set Remote Sens. 2021, 13, 2708 13 of 20

as −18 dB, −15 dB, and −12 dB, respectively. The DNR is −30 dB and K = 15 training samples are used. Figure2a shows that the performance of the proposed GLRT is close to that of the GLRT in [23] when more than nine training samples are employed for the illustrated cases. This is due to that the estimate of unknown covariance matrix of colored noise becomes more and more accurate with the increase of training samples. As depicted in Figure2b, the performance of the proposed GLRT and that of the GLRT in [23] are improved when the TNR is increased, since more accurate estimates of unknown parameters can be obtained with higher TNR.

1 1 UK CovM, K=3 UK CovM, TNR=-18dB UK CovM, K=9 K CovM, TNR=-18dB 0.8 UK CovM, K=15 0.8 UK CovM, TNR=-15dB K CovM K CovM, TNR=-15dB UK CovM, TNR=-12dB 0.6 0.6 K CovM, TNR=-12dB d d P P 0.4 0.4

0.2 0.2

0 0 -3 -2 -1 0 10 10 10 10 10-3 10-2 10-1 100 P P fa fa (a) (b)

Figure 2. Pdvs. Pfa.(a) TNR = −15 dB. (b) K = 15.

Under different DNRs, the dependence of Pd on Pfa is illustrated in Figure3. In Figure3, the TNR is chosen to be −15 dB, and there are K = 15 training samples. The DNRs are −30 dB, −10 dB, and 10 dB, respectively. When the DNR is improved, both the performance of the proposed GLRT and that of the GLRT in [23] are improved, since more accurate estimates of unknown parameters are obtained with high DNR. Compared with the proposed GLRT, the GLRT in [23] can achieve greater performance gain when the DNR is increased, since the proposed GLRT needs to estimate more unknown parameters.

1 UK CovM, DNR=-30dB K CovM, DNR=-30dB 0.8 UK CovM, DNR=-10dB K CovM, DNR=-10dB UK CovM, DNR=10dB 0.6 K CovM, DNR=10dB d P 0.4

0.2

0 10-3 10-2 10-1 100 P fa

Figure 3. Pd vs. Pfa (TNR = −15 dB, K = 15).

4.2. Variation of Pd with TNR

The dependence of the Pd on TNR is illustrated in Figure4. In Figure4a, we consider that there are K = 6, K = 9, K = 15, and K = 21 training samples, respectively. The Pfa is chosen to be 10−3, and the DNR is set as −30 dB. In Figure4b, the DNR is −30 dB, −3 −2 and we consider that there are K = 18 training samples. The Pfas are 10 , 10 , and 10−1, respectively. Remote Sens. 2021, 13, 2708 14 of 20

As shown in Figure4, the P d of both the GLRT in [23] and the proposed GLRT is increased with the increase of TNR, because the estimates of the unknown parameters become more accurate when TNR is increased. Moreover, as the number of training samples increases, the proposed GLRT gradually approaches the GLRT in [23]. Increasing the Pfa, the detection threshold will be reduced, and hence the Pd will be improved.

1 1

0.8 0.8

0.6 0.6 UK CovM, P =10-3 fa d d P P K CovM, P =10-3 fa 0.4 0.4 UK CovM, P =10-2 UK CovM, K=6 fa K CovM, P =10-2 UK CovM, K=9 fa 0.2 0.2 UK CovM, K=15 UK CovM, P =10-1 fa UK CovM, K=21 K CovM, P =10-1 K CovM fa 0 0 -20 -15 -10 -5 0 5 10 -20 -15 -10 -5 0 5 10 TNR(dB) TNR(dB) (a) (b)

−3 Figure 4. The variation of Pd with TNR. (a) DNR = −30 dB, Pfa = 10 .(b) DNR = −30 dB, K = 18.

In Figure5a, the variation of the P d with TNR is illustrated with different DNRs. In Figure5a, the DNRs are considered to be −30 dB, −10 dB, and 10 dB, respectively, and −3 there are K = 21 training samples. The Pfa is set as 10 . In Figure5b, the variation of the P d with TNR for different numbers of receivers and transmitters is shown. In Figure5b, there are K = 20 training samples, the DNR is −3 −30 dB, and the Pfa is chosen to be 10 . There are (Nt = 1, Nr = 2), (Nt = 2, Nr = 2), (Nt = 2, Nr = 3), and (Nt = 3, Nr = 3) transmitters and receivers, respectively. In Figure5a, as expected, with an increase in DNR, the performance of both the GLRT in [23] and the proposed GLRT is improved, because the estimate accuracy of unknown parameters is improved when the DNR is improved. In Figure5b, as expected, the P d is improved when either the number of transmitters or the number of receivers is improved. Obviously, with more transmitters and receivers, we can obtain more observations that will contribute to obtaining more accurate estimates of unknown parameters. Hence, we can achieve higher Pd with more transmitters and receivers.

1 1

0.8 0.8

N =1,N =2 t r N =2,N =2 0.6 0.6 t r

d d N =2,N =3 t r P P N =3,N =3 0.4 UK CovM,DNR=-30dB 0.4 t r K CovM,DNR=-30dB UK CovM,DNR=-10dB 0.2 K CovM,DNR=-10dB 0.2 UK CovM,DNR=10dB K CovM,DNR=10dB 0 0 -20 -15 -10 -5 0 5 10 -20 -15 -10 -5 0 5 10 TNR(dB) TNR(dB) (a) (b)

−3 −3 Figure 5. The variation of Pd with TNR. (a)Pfa = 10 , K = 21.(b)Pfa = 10 , K = 20, DNR = −30 dB. Remote Sens. 2021, 13, 2708 15 of 20

4.3. Pd Loss The performance of the GLRT in [23] is better than that of the proposed GLRT, since the covariance matrix is unknown for the proposed GLRT. To measure the performance degradation, the following Pd loss is utilized, which is defined as

= − LPd PdK PdUK , (58)

where PdUK denotes the Pd of the proposed GLRT, and PdK denotes the Pd of the GLRT in [23]. −3 In Figure6, the P d loss of the proposed method is shown. In Figure6a, the P fa is 10 , the DNR is −30 dB, and the TNRs are, respectively, set as −14 dB, −12 dB, −10 dB, and −8 dB. In Figure6b, the DNR is considered to be −30 dB, the TNR is chosen to be −12 dB, −3 −2 −1 and the Pfas are 10 , 10 , and 10 , respectively. As expected, the Pd loss decreases as the number of training samples increases. This is due to that we can achieve more accurate estimates of the unknown covariance matrix of colored noise with the increase of the number of training samples. As shown in Figure6 , when there are more than 25 training samples, the Pd loss will be close to 0. For low TNR, the Pd loss is smaller as shown in Figure6a. For low TNR, both the GLRT in [ 23] and the proposed GLRT have small Pd, which will result in a small Pd loss. Moreover, the proposed GLRT needs to estimate more unknown parameters than the GLRT in [23]. Using the same training samples, the GLRT in [23] will achieve more performance improvement when the TNR is increased. Hence, the Pd loss will be increased when the TNR is increased. In Figure6b, for large P fa, the Pd loss is larger. Under large Pfa, both the Pd of the GLRT in [23] and that of the proposed GLRT are large. For large Pfa, to achieve the same Pd loss, it needs more training samples for the proposed GLRT.

0.6 0.4 TNR=-14dB P =10-3 0.35 fa 0.5 TNR=-12dB TNR=-10dB P =10-2 0.3 fa TNR=-8dB P =10-1 0.4 fa 0.25

loss 0.2

loss 0.3 d d P P 0.15 0.2 0.1

0.1 0.05

0 0 0 5 10 15 20 25 30 0 5 10 15 20 25 30 Number of training samples K Number of training samples K (a) (b)

−3 Figure 6. The Pd loss of the proposed method. (a) DNR = −30 dB, Pfa = 10 .(b) DNR = −30 dB, TNR = −12 dB.

In Figure7, the dependence of the P d loss on the number of training samples is −3 illustrated. In Figure7, the TNR is −12 dB, the Pfa is 10 , and the DNRs are −30 dB, −10 dB, and 10 dB, respectively. As the number of training samples increases, the Pd loss is decreased. Moreover, the proposed GLRT will need more training samples to obtain the same Pd loss when the DNR is improved. Remote Sens. 2021, 13, 2708 16 of 20

0.6 DNR=-30dB 0.5 DNR=-10dB DNR=10dB

0.4

loss 0.3 d P

0.2

0.1

0 0 5 10 15 20 25 30 Number of training samples K

−3 Figure 7. The Pd loss of the proposed method (TNR = −12 dB, Pfa = 10 ). 5. Conclusions In this paper, target detection of a passive MIMO radar is studied when the transmitted signals, complex channel coefficients, and covariance matrix of colored Gaussian noise are unknown. A GLRT for passive MIMO radar is derived by utilizing the training samples. Simulation results show that the proposed GLRT can almost achieve the same performance of a GLRT with a known covariance matrix if there are sufficient training samples. In this paper, we assume that the noise and clutter are independent and identically distributed, while they might be heterogeneous in practice. In our future work, target detection for passive MIMO radar in the heterogeneous environment will be investigated. Moreover, the intelligent methods has become a hot topic for target detection nowadays, such as machine- learning-based approaches. In our future work, we will also investigate the intelligent method for target detection in the passive MIMO radar network.

Author Contributions: Conceptualization, Y.L. (Yongjun Liu) and G.L.; methodology, Y.L. (Yongjun Liu); software, Y.L. (Yongjun Liu) and H.L.; validation, Y.L. (Yongjun Liu), G.L., and S.Z.; formal analysis, Y.L. (Yongjun Liu) and G.L.; investigation, Y.L. (Yongjun Liu) and S.Z.; resources, Y.L. (Yongjun Liu), Y.Y., and Y.L. (Yachao Li); data curation, Y.L. (Yongjun Liu) and H.L.; writing—original draft preparation, Y.L. (Yongjun Liu) and H.L.; writing—review and editing, Y.L. (Yongjun Liu); visualization, Y.L. (Yongjun Liu) and S.Z.; supervision, Y.L. (Yachao Li), G.L., and Y.Y.; project administration, Y.L. (Yongjun Liu), G.L., and Y.L. (Yachao Li); funding acquisition, Y.L. (Yongjun Liu), G.L., and Y.L. (Yachao Li). All authors have read and agreed to the published version of the manuscript. Funding: This research was funded by the National Natural Science Foundation of China under Grant No. 62001352, by the National Key R & D Program of China under Grant No. 2018YFB2202500, by the Foundation for Innovative Research Groups of the National Natural Science Foundation of China under Grant No. 61621005, and by the National Natural Science Foundation of China under Grant No. 61671352. Institutional Review Board Statement: Not applicable. Informed Consent Statement: Not applicable. Acknowledgments: The authors would like to thank the anonymous reviewers for their valuable and useful comments and suggestions that helped improve the paper. Conflicts of Interest: The authors declare no conflict of interest. Remote Sens. 2021, 13, 2708 17 of 20

Appendix A. Proof of Theorem 1 Since A is a positive Hermitian matrix, according to Theorem 21.5.7 in [47], A is H orthogonally diagonalizable, i.e., A = VDV , where V = [v1, v2,..., vn] is an orthogonal matrix, D = diag{d1, d2,..., dn} is a diagonal matrix, and 0 < d1 ≤ d2 ≤,..., ≤ dn. For any n × m matrix Xm that satisfies the constraint in (35), we can obtain that the H m × m matrix G(Xm) = Xm AXm is also a positive Hermitian matrix, since A is a positive Hermitian matrix. Thus, according to Theorem 21.5.7 in [47], G(Xm) is orthogonally diag- H onalizable, i.e., G(Xm) = WΦW , where W = [w1, w2,..., wm] is an m × m orthogonal matrix, Φ = diag{φ1, φ2,..., φm} is a diagonal matrix, and 0 < φ1 ≤ φ2 ≤,..., ≤ φm. H Let G(Xm) = |Xm AXm| = |G(Xm)| represent the objective function in (35). Since W is an orthogonal matrix, we can obtain

H H H H G(Xm) = |Xm AXm| = |W Xm AXmW| = |Zm AZm|, (A1)

where Zm = XmW with the ith column of Zm being zi = Xmwi, for i = 1, 2, . . . , m. H Since G(Xm) = WΦW , we can obtain

m G(Xm) = |Φ| = ∏ φi (A2) i=1 H = |Zm AZm| = G(Zm), (A3)

where from (A2) to (A3) we use (A1). m m Suppose that ∏i=1 φi < ∏i=1 di, i.e., suppose that there is a solution Zm that satisfies m m G(Zm) = ∏i=1 φi < ∏i=1 di. Since φi and di, for i = 1, 2, ... , m, are positive and non- decreasing with the increase of i, there must be some i0 that satisfies

φi0 < di0 . (A4)

H 0 Note that Zm Zm = Im. Hence, the first i columns of Zm, denoted as Zi0 , form the basis 0 0 for an i -dimensional subspace Si of n-dimensional complex space Cn. Any n × 1 vector x 0 in the subspace Si can be represented as

i0 x = ∑ αizi, (A5) i=1

0 0 i0 where αi, for i = 1, 2, . . . , i , are the coefficients of x in the i -dimensional subspace S . Hence, we can obtain

H i0 2 x Ax ∑i=1 |αi| φi = = 0 max H max 0 φi . (A6) i0 x x α ,i=1,2,...,i0 i 2 x∈S ,x6=0 i ∑i=1 |αi| Using Theorem 4.2.6 in [48], we can obtain that

xHAx = 0 ≤ 0 min max H di φi , (A7) dim(S)=i0 x∈S,x6=0 x x

where dim(S) represents the dimension of the subspace S. Obviously, the obtained solution in (A7) contradicts the assumption in (A4). This completes the proof. Theorem 1 can also be proved by majorization theory [49,50]. Remote Sens. 2021, 13, 2708 18 of 20

Appendix B. Maximum Likelihood Estimate Uˆ h i The optimal solution to (34) is Vˆ = Ψ = ψ , ψ ,..., ψ . 2Nr Nt−Nt 2Nr Nt−Nt 1 2 2Nr Nt−Nt Hence, we can obtain

 † ⊥ ˆ H ˆ ˆ H ˆ H P H = I2N N − U UsrU Usr = I2N N − P ˆ H = Ψ2N N −N Ψ − . (A8) Uˆ sr r t sr sr r t Usr r t t 2Nr Nt Nt

Furthermore, we can obtain

 † ˆ H ˆ ˆ H ˆ ⊥ H P ˆ H =U UsrU Usr = I2N N − P H = I2N N − Ψ2N N −N Ψ − . (A9) Usr sr sr r t Uˆ sr r t r t t 2Nr Nt Nt h Note that Ψ = [Ψ , Ψ ] and ΨΨH = I , where Ψ = ψ , 2Nr Nt−Nt Nt 2Nr Nt Nt 2Nr Nt−Nt+1 i ψ ,..., ψ is a 2N N × N full-column rank matrix. Hence, we can ob- 2Nr Nt−Nt+2 2Nr Nt r t t tain that

H H P H =ΨΨ − Ψ − Ψ Uˆ sr 2Nr Nt Nt 2Nr Nt−Nt  " H # I − 0 Ψ =Ψ Ψ  2Nr Nt Nt (2Nr Nt−Nt)×Nt 2Nr Nt−Nt 2Nr Nt−Nt Nt 0 I ΨH Nt×(2Nr Nt−Nt) Nt Nt  " H # I − 0 Ψ − Ψ Ψ  2Nr Nt Nt (2Nr Nt−Nt)×Nt 2Nr Nt−Nt 2Nr Nt−Nt Nt 0 0 ΨH Nt×(2Nr Nt−Nt) Nt Nt  " H #   02Nr Nt−Nt 0(2Nr Nt−Nt)×Nt Ψ2N N −N H = Ψ − Ψ r t t = Ψ Ψ . (A10) 2Nr Nt Nt Nt 0 I ΨH Nt Nt Nt×(2Nr Nt−Nt) Nt Nt

ˆ H Now we show that Usr = ΨNt ΩΓ, where Ω is any Nt × Nt invertible diagonal matrix, H H and Γ is any Nt × Nt orthogonal matrix, i.e., Γ Γ = ΓΓ = INt . We can verify that

† † H H  H H  H H P H = Uˆ Uˆ Uˆ Uˆ = Ψ ΩΓ Γ ΩΨ Ψ ΩΓ Γ ΩΨ . (A11) Uˆ sr sr sr sr sr Nt Nt Nt Nt

 † Note that ΨH Ψ = I and ΓHΩΨH Ψ ΩΓ = ΓHΩ−2Γ. Hence, (A11) can be Nt Nt Nt Nt Nt simplified as

H −2 H H H P H = Ψ ΩΓΓ Ω ΓΓ ΩΨ = Ψ Ψ , (A12) Uˆ sr Nt Nt Nt Nt which is the same result in (A10). Hence, the maximum likelihood estimate Uˆ sr is

ˆ = H H Usr Γ ΩΨNt . (A13)

References 1. Liu, J.; Li, H.; Himed, B. On the performance of the cross-correlation detector for passive radar applications. Signal Process. 2015, 113, 32–37. [CrossRef] 2. Cui, G.; Liu, J.; Li, H.; Himed, B. Signal detection with noisy reference for passive sensing. Signal Process. 2015, 108, 389–399. [CrossRef] 3. Subedi, S.; Zhang, Y.D.; Amin, M.G.; Himed, B. Group sparsity based multi-target tracking in passive multi-static radar systems using Doppler-only measurements. IEEE Trans. Signal Process. 2016, 64, 3619–3634. [CrossRef] 4. Prateek, G.; Hurtado, M.; Nehorai, A. Target detection using weather radars and electromagnetic vector sensors. Signal Process. 2017, 137, 387–397. [CrossRef] 5. Zhang, X.; Li, H.; Himed, B. Multistatic detection for passive radar with direct-path interference. IEEE Trans. Aerosp. Electron. Syst. 2017, 53, 915–925. [CrossRef] 6. Zhang, Y.D.; Amin, M.G.; Himed, B. Structure-aware sparse reconstruction and applications to passive multistatic radar. IEEE Aerosp. Electron. Syst. Mag. 2017, 32, 68–78. [CrossRef] Remote Sens. 2021, 13, 2708 19 of 20

7. He, Q.; Blum, R.S. The significant gains from optimally processed multiple signals of opportunity and multiple receive stations in passive radar. IEEE Signal Process. Lett. 2014, 21, 180–184. [CrossRef] 8. Zhao, H.Y.; Liu, J.; Zhang, Z.J.; Liu, H.; Zhou, S. Linear fusion for target detection in passive multistatic radar. Signal Process. 2017, 130, 175–182. [CrossRef] 9. Chalise, B.K.; Himed, B. GLRT detector in single frequency multi-static passive radar systems. Signal Process. 2018, 142, 504–512. [CrossRef] 10. Fazlollahpoor, M.; Derakhtian, M.; Khorshidi, S. Rao Detector for Passive MIMO Radar With Direct-Path Interference. IEEE Trans. Aerosp. Electron. Syst. 2020, 56, 2999–3009. [CrossRef] 11. Li, Y.; He, Q.; Blum, R.S. Limited-Complexity Receiver Design for Passive/Active MIMO Radar Detection. IEEE Trans. Signal Process. 2019, 67, 3258–3271. [CrossRef] 12. Horstmann, S.; Ramírez, D.; Schreier, P.J. Two-Channel Passive Detection of Cyclostationary Signals. IEEE Trans. Signal Process. 2020, 68, 2340–2355. [CrossRef] 13. Santamaria, I.; Scharf, L.L.; Via, J.; Wang, H.; Wang, Y. Passive detection of correlated subspace signals in two MIMO channels. IEEE Trans. Signal Process. 2017, 65, 5266–5280. [CrossRef] 14. Santamaria, I.; Via, J.; Scharf, L.L.; Wang, Y. A GLRT approach for detecting correlated signals in white noise in two MIMO channels. In Proceedings of the 2017 25th European Signal Processing Conference (EUSIPCO), Kos, Greece, 28 August–2 September 2017; IEEE: Hoboken, NJ, USA, 2017; pp. 1395–1399. 15. Santamaria, I.; Scharf, L.L.; Cochran, D.; Vía, J. Passive detection of rank-one signals with a multiantenna reference channel. In Proceedings of the 2016 24th European Signal Processing Conference (EUSIPCO), Budapest, Hungary, 29 August–2 September 2016; IEEE: Hoboken, NJ, USA, 2016; pp. 140–144. 16. Howard, S.D.; Sirianunpiboon, S. Passive radar detection using multiple transmitters. In Proceedings of the 2013 Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, USA, 3–6 November 2013; IEEE: Hoboken, NJ, USA, 2013; pp. 945–948. 17. Karthik, A.K.; Blum, R.S. Improved Detection Performance for Passive Radars Exploiting Known Communication Signal Form. IEEE Signal Process. Lett. 2018, 25, 1625–1629. [CrossRef] 18. Liu, Y.; Blum, R.S.; Liao, G.; Zhu, S. Passive MIMO radar detection exploiting known format of the communication signal observed in colored noise with unknown covariance matrix. Signal Process. 2020, 174, 107611. [CrossRef] 19. Liu, Y.; Liao, G.; Xu, J.; Yang, Z.; Yin, Y. Improving detection performance of passive MIMO radar by exploiting the preamble information of communications signal. IEEE Syst. J. 2020, 1–12. [CrossRef] 20. Colone, F.; O’hagan, D.; Lombardo, P.; Baker, C. A multistage processing algorithm for disturbance removal and target detection in passive bistatic radar. IEEE Trans. Aerosp. Electron. Syst. 2009, 45, 698–722. [CrossRef] 21. Zaimbashi, A.; Derakhtian, M.; Sheikhi, A. GLRT-based CFAR detection in passive bistatic radar. IEEE Trans. Aerosp. Electron. Syst. 2013, 49, 134–159. [CrossRef] 22. Masjedi, M.; Moddares-Hashemi, M.; Sadri, S. Theoretical approach for target detection and interference cancellation in passive radars. IET Radar Sonar Navigat. 2013, 7, 205–216. [CrossRef] 23. Hack, D.E.; Patton, L.K.; Himed, B.; Saville, M.A. Detection in passive MIMO radar networks. IEEE Trans. Signal Process. 2014, 62, 2999–3012. [CrossRef] 24. Hack, D.E.; Patton, L.K.; Himed, B.; Saville, M.A. Centralized passive MIMO radar detection without direct-path reference signals. IEEE Trans. Signal Process. 2014, 62, 3013–3023. 25. Wang, Y.; Scharf, L.L.; Santamaría, I.; Wang, H. Canonical correlations for target detection in a passive radar network. In Proceedings of the 2016 50th Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, USA, 6–9 November 2016; IEEE: Hoboken, NJ, USA, 2016; pp. 1159–1163. 26. Daun, M.; Nickel, U.; Koch, W. Tracking in multistatic passive radar systems using DAB/DVB-T illumination. Signal Process. 2012, 92, 1365–1386. [CrossRef] 27. Wang, H.; Wang, J.; Zhong, L. Mismatched filter for analogue TV-based passive bistatic radar. IET Radar Sonar Navigat. 2011, 5, 573–581. [CrossRef] 28. Sun, H.; Tan, D.K.; Lu, Y.; Lesturgie, M. Applications of passive surveillance radar system using cell phone base station illuminators. IEEE Aerosp. Electron. Syst. Mag. 2010, 25, 10–18. [CrossRef] 29. Kelly, E.J. An adaptive detection algorithm. IEEE Trans. Aerosp. Electron. Syst. 1986, 2, 115–127. [CrossRef] 30. Xiao, Y.; Cui, G.; Yi, W.; Kong, L.; Yang, J. Adaptive detection and estimation for an unknown occurring interval signal in correlated Gaussian noise. Signal Process. 2015, 108, 440–450. [CrossRef] 31. He, Q.; Lehmann, N.H.; Blum, R.S.; Haimovich, A.M. MIMO radar moving target detection in homogeneous clutter. IEEE Trans. Aerosp. Electron. Syst. 2010, 46, 1290–1301. [CrossRef] 32. Li, H.; Wang, Z.; Liu, J.; Himed, B. Moving target detection in distributed MIMO radar on moving platforms. IEEE J. Sel. Top. Signal Process. 2015, 9, 1524–1535. [CrossRef] 33. Liu, W.; Wang, Y.; Liu, J.; Xie, W.; Chen, H.; Gu, W. Adaptive detection without training data in colocated MIMO radar. IEEE Trans. Aerosp. Electron. Syst. 2015, 51, 2469–2479. [CrossRef] 34. Liu, J.; Zhou, S.; Liu, W.; Zheng, J.; Liu, H.; Li, J. Tunable Adaptive Detection in Colocated MIMO Radar. IEEE Trans. Signal Process. 2017, 66, 1080–1092. [CrossRef] Remote Sens. 2021, 13, 2708 20 of 20

35. Wang, Z.; Li, H.; Himed, B. A sparsity based GLRT for moving target detection in distributed MIMO radar on moving platforms. In Proceedings of the 2015 49th Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, USA, 8–11 November 2015; IEEE: Hoboken, NJ, USA, 2015; pp. 90–94. 36. Dogandzic, A.; Nehorai, A. Generalized multivariate analysis of variance-A unified framework for Signal Processing in correlated noise. IEEE Signal Process. Mag. 2003, 20, 39–54. [CrossRef] 37. Robey, F.C.; Fuhrmann, D.R.; Kelly, E.J.; Nitzberg, R. A CFAR adaptive matched filter detector. IEEE Trans. Aerosp. Electron. Syst. 1992, 28, 208–216. [CrossRef] 38. Monticciolo, P.; Kelly, E.; Porakis, J. A noncoherent adaptive detection technique. IEEE Trans. Aerosp. Electron. Syst. 1992, 28, 115–124. [CrossRef] 39. Tang, B.; Liu, J.; Huang, Z.; Wang, G.; Fan, F. Adaptive Target Detection in Gaussian Clutter Edges. IEEE Trans. Aerosp. Electron. Syst. 2020, 56, 1662–1673. [CrossRef] 40. Steven, M.K. Fundamentals of Statistical Signal Processing; PTR Prentice-Hall: Englewood Cliffs, NJ, USA, 1993. 41. Wang, F.; Li, H.; Zhang, X.; Himed, B. Signal Parameter Estimation for Passive Bistatic Radar With Waveform Correlation Exploitation. IEEE Trans. Aerosp. Electron. Syst. 2018, 54, 1135–1150. [CrossRef] 42. Wang, F.; Wang, P.; Zhang, X.; Li, H.; Himed, B. An Overview of Parametric Modeling and Methods for Radar Target Detection With Limited Data. IEEE Access 2021, 9, 60459–60469. [CrossRef] 43. Palmer, J.E.; Searle, S.J. Evaluation of adaptive filter algorithms for clutter cancellation in passive bistatic radar. In Proceedings of the 2012 IEEE Radar Conference, Atlanta, GA, USA, 7–11 May 2012; IEEE: Hoboken, NJ, USA, 2012; pp. 0493–0498. 44. Scharf, L.L.; Demeure, C. Statistical Signal Processing: Detection, Estimation, and Time Series Analysis; Addison-Wesley: Reading, MA, USA, 1991; Volume 63. 45. Kay, S.M. Fundamentals of Statistical Signal Processing. Detection Theory, Volume II; Prentice Hall: Upper Saddle River, NJ, USA, 1998. 46. Muirhead, R.J. Aspects of Multivariate Statistical Theory; John Wiley & Sons: Hoboken, NJ, USA, 1982; Volume 197. 47. Harville, D.A. Matrix Algebra from a Statistician’s Perspective; Springer: Berlin/Heidelberg, Germany, 1997; Volume 1. 48. Horn, R.A.; Johnson, C.R. Matrix Analysis; Cambridge University Press: Cambridge, UK, 2013. 49. Tang, B.; Tang, J.; Peng, Y. MIMO radar waveform design in colored noise based on . IEEE Trans. Signal Process. 2010, 58, 4684–4697. [CrossRef] 50. Tang, B.; Li, J. Spectrally Constrained MIMO Radar Waveform Design Based on Mutual Information. IEEE Trans. Signal Process. 2019, 67, 115–127. [CrossRef]