SIGNAL APPROXIMATION USING THE BILINEAR TRANSFORM

Archana Venkataraman, Alan V. Oppenheim

MIT Digital Group 77 Massachusetts Avenue, Cambridge, MA 02139 [email protected], [email protected]

ABSTRACT The analysis presented in this paper has application in contexts This paper explores the approximation properties of a unique basis where only a fixed number of DT values can be used to represent a expansion, which realizes a bilinear frequency warping between a CT signal. For example, in a binary detection problem, one might continuous-time signal and its discrete-time representation. We in- want to limit the number of digital multiplies used to compute the vestigate the role that certain parameters and signal characteristics inner product of two CT signals. Numerical simulations of this sce- have on these approximations, and we extend the analysis to a win- nario suggest that the bilinear expansion achieves a better detection dowed representation, which increases the overall time resolution. performance than Nyquist sampling for certain signal classes. Approximations derived from the bilinear representation and from Nyquist sampling are compared in the context of a binary detection 2. THE BILINEAR REPRESENTATION problem. Simulation results indicate that, for many types of signals, the bilinear approximations achieve a better detection performance. As derived in [1], the network shown in Fig. 1 realizes a one-to-one Index Terms— Signal Representations, Approximation Meth- frequency warping between the Laplace and Z-transform domains ods, Bilinear Transformations, Signal Detection according to the bilinear transform. Specifically, the Laplace trans- form F (s) of the signal f(t) and the Z-transform F (z) of the se- f[n] 1. INTRODUCTION quence are related through the change of variables a + s Although Nyquist sampling is commonly used in practice, there re- z = (1) main several drawbacks to this representation. For example, the a − s continuous-time (CT) signal must be appropriately bandlimited in where a is the real valued parameter indicated in the cascades of order to avoid frequency distortions. Additionally, if the Fig. 1. The relationship in (1) maps the entire range of CT frequen- number of time samples used in a particular computation is con- jω strained, the Nyquist approximation may do a poor job of represent- cies ( -axis) onto the range of unique DT frequencies (unit circle). ing the original signal. For these reasons, it is useful to consider Fig. 1 can also be viewed as a basis expansion of the CT signal f(t) alternative signal representations. in which the Laplace transforms of the basis functions are √ „ « In the 1960’s a basis expansion was proposed [1, 2], imple- n−1 menting a nonlinear frequency warping between a CT signal and 2a a − s Λn(s)= ,n≥ 1 (2) its discrete-time (DT) representation according to the bilinear trans- a + s a + s form. Since there is a one-to-one relationship between the two fre- quency domains, this bilinear expansion theoretically avoids both As given in [1], the corresponding time-domain expressions are the bandlimited requirement and the frequency aliasing distortions √ n−1 −at associated with Nyquist sampling. Furthermore, the DT expansion λn(t)= 2a(−1) e Ln−1(2at)u(t),n≥ 1 (3) coefficients can be obtained using a cascade of first-order analog sys- tems. Modern-day integrated circuit technology has made it practi- where Ln(·) represents a zero-order Laguerre polynomial. It is shown {λ (t)}∞ cal to compute these coefficients through conventional circuit design in [1] that n n=1 is an orthonormal set of functions which span techniques. Consequently, the bilinear expansion can be considered the space of causal, finite-energy signals. Fig. 2 depicts the bilinear as an alternative to Nyquist representations in various applications. basis functions as the index n and the parameter a vary. In this paper, we explore the approximation performance of the By applying properties of Laguerre polynomials [3, 4] and by bilinear expansion presented in [1] by drawing from properties of defining ξ =2n − 3, the bilinear basis functions can be bounded the corresponding basis functions. We consider how various signal according to the expression characteristics, such as a rational and the energy 8 distribution over time, affect the approximations. Additionally, we > 1, 0 ≤ t ≤ 1 > 4aξ examineamodified version of the bilinear representation, in which <> (4aξt)−1/4, 1 ≤ t ≤ ξ 4aξ 2a “ h i” 1 the CT signal is segmented using a short-duration window. This |λn(t)|≤C − > 2ξ (2ξ)1/3 + |2ξ − 2at| 4 , ξ ≤ t ≤ 3ξ segmentation procedure helps to improve the overall approximation > 2 2 :> a a quality by exploiting properties of the representation. e−βt,t≥ 3ξ 2a This work was supported in part by the MIT Provost Presidential Fellow- (4) ship, the NDSEG Fellowship, The Texas Instruments Leadership University where β is a positive constant and a>0. Eq. (4) will be useful Program, BAE Systems PO 112991, and Lincoln Laboratory PO 3077828. when analyzing the approximation properties of this representation.

1-4244-1484-9/08/$25.00 ©2008 IEEE 3729 ICASSP 2008

Authorized licensed use limited to: MIT Libraries. Downloaded on May 17,2010 at 19:14:02 UTC from IEEE Xplore. Restrictions apply. √ f(−t) 2a a−s a−s a−s a−s a+s a+s a+s a+s a+s 1.5 n = 1 t =0 t =0 t =0 t =0 t =0 n = 3 1 n = 5 f[1] f[2] f[3] f[4] f[5] n = 7

(a) Analysis Network 0.5

√ δ(t) 2a a−s a−s a−s a+s a+s a+s a+s 0

f[1] f[2] f[3] f[4] −0.5

0 5 10 15 20 time, t Σ (a) λn(t) for different index values; a =1

f(t) 1.5 a = 1 a = 2 (b) Synthesis Network 1 a = 3

Fig. 1. First order cascade derived from [1] and [2]. The analysis 0.5 network converts the CT signal f(t) into its DT representation f[n]. 0 The synthesis network reconstructs a CT signal from its expansion. −0.5

3. BILINEAR APPROXIMATION −1 0 5 10 15 In general, representing a CT signal through a basis expansion re- time, t λ (t) a n =5 quires an infinite number of (non-zero) expansion coefficients. We (b) n for different values of ; address this issue by retaining an appropriate subset IM of bilinear expansion terms to form an M-term signal approximation. The ap- Fig. 2. Dependence of the bilinear basis functions on the index n proximation error [M] is given by and the parameter a. 0 1 Z 2 ∞ X finite duration, successive delays of τ (ω ) will eventually shift the @ A g o [M]= f(t) − f[n]λn(t) dt (5) bulk of its energy beyond the sampling time t =0in Fig. 1. Once 0 n∈IM this occurs, the remaining expansion coefficients become very small. Therefore, in order to minimize the number of significant DT coeffi- We focus on a nonlinear approximation technique, in which the cients for a narrow-band signal, we should maximize τg(ωo) in (6). set IM is chosen based on the characteristics of f(t). Specifically, This is accomplished by setting a = ωo. because the bilinear basis functions are orthonormal, we retain the [M] Although many CT signals of interest are not narrow-band, the largest-magnitude DT coefficients to minimize . We qualita- above analysis suggests the following ‘maximin’ strategy to initial- tively compare approximation performances through the decay of ize the value of a: For a signal with most of its information content expansion coefficients when sorted by absolute value. As proposed effectively band-limited to |ω|≤ω , choose a = ω . for wavelet approximations in [5, 6], a faster decay corresponds to a M M Because the group delay in Equation (6) is monotonically de- smaller M-term approximation error. creasing in ω, this strategy guarantees a group delay greater than or equal to τg(ωM ) for all frequencies in the band [−ωM ,ωM ].Un- 3.1. Effect of the Parameter a like a Nyquist anti-aliasing filter, this algorithm does not eliminate As seen in Fig. 2(b), the behavior of the basis functions is affected by high-frequency information. Rather, it tries to capture as much sig- the parameter a. Predictably, this has a direct impact on the approxi- nal energy as possible in the earlier DT coefficients mation performance. As an example, we analyze the bilinear approx- imations of the windowed sinusoid f(t) ∝ sin(10t) for 0 ≤ t<1, 3.2. Signal Characteristics which Impact Approximation and then generalize from this information. The signal f(t) has been normalized for unit energy. It is straightforward to verify that signals which have rational Laplace Fig. 3(a)-(d) show the bilinear coefficients f[n] for a =1, 10, 100 transforms with all poles located at s = −a can be represented ex- and 1000, respectively. As seen, the fastest coefficient decay occurs actly using a finite number of bilinear coefficients. In the time do- when a =10, which is the carrier frequency of f(t). An intuitive main, this corresponds to polynomials in t weighted by the expo- −at basis for this result follows by considering the group delay of the nential decay e . The approximation performance worsens as the all-pass filters in the bilinear first-order cascades. pole location(s) move farther away from s = −a. This is confirmed » „ «– in simulation by examining the sorted coefficient decay of signals d a − jω 2a f (t) ∝ t3e−kt and f (t) ∝ e−at sin(pt) for different values of k τ (ω)= ∠ = (6) k p g dω a + jω a2 + ω2 and p, respectively.

For a signal whose energy is tightly concentrated around a fre- For signals which do not posses rational Laplace transforms, one quency of ωo, the effect of an all-pass filter can be roughly approx- important characteristic affecting the bilinear approximation perfor- imated as a time delay of τg(ωo). If the signal has approximately mance is the energy distribution over time. We observe this relation-

3730

Authorized licensed use limited to: MIT Libraries. Downloaded on May 17,2010 at 19:14:02 UTC from IEEE Xplore. Restrictions apply. 0.3 0.5 Approximation Type #DTTerms Duration, T 0.2 Lin NL1 NL2

0.1 Original 0.6840 0.4567 —– 0.2 sec 0.6832 0.4532 0.0323 0 0 50 0.25 sec 0.5088 0.4156 0.3044 −0.1 0.34 sec 0.6533 0.4260 0.0930 −0.2 0.5 sec 0.5056 0.4062 0.3819

−0.5 Original 0.4487 0.2166 —– 0 100 200 300 400 500 0 100 200 300 400 500 Coefficient Index, n Coefficient Index, n 0.2 sec 0.4487 0.2319 0.0078 (a) a =1 (b) a =10 100 0.25 sec 0.4274 0.3474 0.1822 0.34 sec 0.4234 0.1871 0.0102 0.5 sec 0.4243 0.3412 0.2812 0.2 0.06 Original 0.1588 0.0231 —– 0.04 0.1 0.2 sec 0.1089 0.0422 0.0031 0.02 200 0.25 sec 0.3559 0.2478 0.1014 0 0 0.34 sec 0.1032 0.0322 0.0028 −0.02 −0.1 0.5 sec 0.3568 0.2455 0.1663 −0.04

−0.2 −0.06 −0.08 [M] f(t) ∝ sinc(100(t − 0.5)) a = 100 0 100 200 300 400 500 0 100 200 300 400 500 Table 1. for .Avalueof Coefficient Index, n Coefficient Index, n is used for all bilinear expansions. (c) a = 100 (d) a = 1000 Mathematically, we treat the original CT signal as a sum of seg- Fig. 3. Bilinear Expansion Coefficients for f(t) ∝ sin(10t) ments fk(t) created by a finite-duration window w(t) as follows: X∞ X 0.5 f(t)= f(t)w(t − kT) w(t − kT)=1, ∀t k = 0 | {z } s.t. (7) k = 0.25 k=0 k ( − ) 0.4 k = 0.5 fk t kT k = 0.75 k = 1 0.3 The choice of window is heavily dependent on the application. For the binary detection problem in Section 5, we segment using a 0.2 non-overlapping rectangular window. Given our assumption of addi- tive white noise, this window choice simplifies the resulting analysis. 0.1 The rectangular window may also be advantageous for bilinear ap- proximation, since it yields the shortest segment duration for a given Sorted Bilinear Coefficient Value T 0 value of in (7). This may translate to fewer significant expansion 10 20 30 40 50 M Coefficient Index, n coefficients and a smaller -term approximation error. We illustrate the benefit of segmentation by approximating the signal f (t) ∝ sinc(10(t − k)) a =10 Fig. 4. Sorted DT coefficients of k ; j sinc(100(t − 0.5)), 0 ≤ t<1 f(t) ∝ 0, ship by considering the set of signals otherwise j sinc(10(t − k)), 0 ≤ t<1 The following terminology is used to denote the three approxi- f (t) ∝ k 0, o.w. mation methods employed in this work: Lin The first M coefficients are retained from each segment. Given where sinc(x)=sin(πx)/(πx) and fk(t) has been normalized for the cascades in Fig. 1(a), this linear approximation scheme unit energy. A windowed sinc pulse is chosen because of the large minimizes the system hardware requirements. energy concentration around its main lobe in time. NL1 The M largest coefficients are retained from each segment. The sorted bilinear coefficients are depicted in Fig. 4. Clearly, as the energy concentration moves farther from the time origin, the NL2 The largest coefficient is selected from each segment. For the approximation performance worsens. This empirical observation is remaining terms, the largest coefficients overall are selected. consistent with the basis function behavior as the index increases. Table 1 shows the approximation errors for different rectangular Specifically, from Fig. 2 and as suggested by the bound in (4), the λ (t) n window sizes and each of the three approximation techniques. There time dispersion of n increases with . Consequently, it would be are two main points to note from this data. expected that representing signals with primary energy concentration First, the segmented representation can achieve a much lower later in time requires basis functions with higher index values. error than the original bilinear representation when using the NL2 approximation technique, especially as the total number of DT coef- 4. THE WINDOWED REPRESENTATION ficients decreases. This is due to the increased time resolution. Second, the approximation performance for window durations As shown, the bilinear expansion is well-suited to signals with en- of T =0.25, 0.5 is notably poorer than for T =0.2, 0.34.Thisis ergy concentrated early in time. We exploit this property through a because the former values of T divide the main lobe in half, caus- windowed representation. Not only does windowing a signal provide ing one segment to possess a rapidly-increasing signal with a large greater time resolution, but it can better align signal energy with the amount of energy. Results from Section 3 suggest that a large num- time origin of individual segments to improve the approximation. ber of expansion terms will be required to represent this segment.

3731

Authorized licensed use limited to: MIT Libraries. Downloaded on May 17,2010 at 19:14:02 UTC from IEEE Xplore. Restrictions apply. 5. BINARY DETECTION PROBLEM 1 Ideal In this section we evaluate the bilinear approximation performance SLS 0.8 Lin in a classical binary detection problem. The goal is to determine NL2 s(t) whether a desired signal is present in noise based on the received 0.6 signal x(t). The channel noise η(t) is additive white Gaussian noise P (jω)=σ2 (AWGN) with power spectrum, η η. 0.4 The well-known matched filter solution involves comparing the integral of the desired and received signals with a threshold γ Probability of Detection 0.2 Z ∞ x(t)s(t)dt ≷ γ 0 (8) 0 0.2 0.4 0.6 0.8 1 0 Probability of False Alarm We consider a scenario in which the desired signal is very com- (a) s1(t), 5 DT Multiplies plex, meaning that we cannot design the analog matched filter h(t)= s(−t) 1 . Therefore, (8) must be implemented using DT representa- Ideal tions x[n] and s[n]. Furthermore, we restrict the number of DT mul- SLS tiplies used to approximate the above integral. For an orthogonal 0.8 Lin expansion, the detection performance when using M multiplies is NL2 inversely-related to the approximation error [M]. 0.6

The detection performances of the bilinear and Nyquist approxima- 0.4 tions are compared in a series of MATLAB simulations. We consider

the following desired signals (normalized to have unit energy): Probability of Detection 0.2

2 −150t s1(t) ∝ t e u(t) 0 j 0 0.2 0.4 0.6 0.8 1 sin(100t), 0 ≤ t<1 Probability of False Alarm s2(t) ∝ 0, otherwise (b) s2(t), 25 DT Multiplies, T =0.34 j sinc(100(t − 0.5)), 0 ≤ t<1 s3(t) ∝ 1 0, otherwise Ideal SLS 0.8 Lin s1(t) has a rational Laplace transform. s2(t) is narrow-band in fre- NL2 quency with a constant energy distribution over time. s3(t) has a wider frequency range with energy concentrated around the main 0.6 lobe. All signal are sampled approximately 100X faster than the 0.4 of the corresponding s(t). The reduced sets of Nyquist coefficients are obtained by select-

Probability of Detection 0.2 ing those time samples corresponding to the largest magnitudes in s(nTs). This is denoted ‘Select Largest Samples’ or SLS. 0 For the bilinear representation, the sequences are segmented with 0 0.2 0.4 0.6 0.8 1 a non-overlapping rectangular window, and the expansion coeffi- Probability of False Alarm s (t) T =0.34 cients are computed according to Fig.1(a). The trapezoidal rule for (c) 3 , 25 DT Multiplies, integration is used when numerically simulating the analog systems. To reduce the total number of DT coefficients, we employ the Lin Fig. 5. ROC Curves for the binary detection problem. and NL2 techniques, based on the desired signal s(t). binary detection scenarios. The NL2 performance consistently ex- Receiver Operating Characteristic (ROC) curves for each desired ceeded that of the Nyquist SLS method for a given number of DT signal are shown in Fig. 5. The data is based on Monte Carlo experi- 2 multiplies. Furthermore, in applications where CT signals are not ments with a = 100 and σ =1. The ‘Ideal’ curve is the theoretical η appropriately band-limited, using the bilinear representation may be ROC obtained when directly computing the integral in (8). favorable to eliminating signal content via an anti-aliasing filter. The bilinear performance conforms to the intuition gained in s (t) Sections 3 and 4. Specifically, the detection for 1 is close to ideal 6. REFERENCES with only 5 DT multiplies. Conversely, the bilinear approximations [1] Kenneth Steiglitz, The General Theory of Digital Filters with Applications to Spec- s2(t) s3(t) are not as good for and , which do not have energy con- tral Analysis, Ph.D. thesis, New York University, 1963. centrated early in time. As seen in Fig. 5, the performance is not [2] Alan V. Oppenheim and Donald H. Johnson, “Discrete representation of signals,” ideal, despite the increased number of multiplies. Proc. of the IEEE, 1972, vol. 60, pp. 681–691. In contrast, the Nyquist SLS method does well only when the [3] Sundaram Thangavelu, Lectures on Hermite and Laguerre Expansions, Princeton signal energy is localized in time and can be captured using a few University Press, 1993. samples, such as the pulse s3(t).However,SLS does not perform as [4] Allan M. Krall, Hilbert Space Boundary Value Problems and Orthogonal Polyno- mials, Birkhauser Verlag, 2002. well as the NL2 bilinear approximation for any of the desired signals. [5] S. G. Mallat, A Wavelet Tour of Signal Processing, Academic Press, 1999. Simulation results based on synthetic signals s1(t)-s3(t) indicate [6] Martin Vetterli, “Wavelets, approximation and compression,” IEEE Signal Pro- cessing Magazine, vol. 18, pp. 53–73, September 2001. that using the bilinear representations may be appropriate in certain

3732

Authorized licensed use limited to: MIT Libraries. Downloaded on May 17,2010 at 19:14:02 UTC from IEEE Xplore. Restrictions apply.