Reassignment and synchrosqueezing for general time-frequency filter banks, subsampling and processing
Nicki Holighausa,∗, Zdenek Prusaa, Peter L. Søndergaardb
aAcoustics Research Institute, Austrian Academy of Sciences, Wohllebengasse 12–14, 1040 Vienna, Austria bOticon A/S, 2765 Smørum, Denmark
Abstract
In this contribution, we extend the reassignment method (RM) and synchrosqueezing transform (SST) to arbitrary time-frequency localized filters and, in the first case, arbitrary downsampling factors. A sufficient condition for the invertibility of the SST is provided. RM and SST are techniques for deconvolution of short- time Fourier and wavelet representations. In both methods, the partial phase derivatives of a complex-valued time-frequency representation are used to determine the instantaneous frequency and group delay associated to the individual representation coefficients. Subsequently, the coefficient energy is reassigned to the determined position. Combining RM and frame theory, we propose a processing scheme that benefits both from the improved localization of the reassigned representation and the frame properties of the underlying complex- valued representation. This scheme is particularly interesting for applications that require low redundancy not achievable by an invertible SST.
1. Introduction
Time-frequency representations such as the short-time Fourier transform (STFT) [1, 2] are ubiquitous in signal analysis. In particular their squared magnitude, the spectrogram is frequently used to determine the local frequency content of an analyzed signal. The spectrogram however provides a biased, smoothed representation highly dependent on the chosen STFT window function. The smoothing effect in the spectrogram is subject to Heisenberg’s uncertainty inequality and thus cannot be arbitrarily reduced. Therefore, various alternative time-frequency representations have been proposed. The quadratic time- frequency representations in Cohen’s class [3, 4] are given by the Wigner-Ville distribution (WVD) [5, 6] convolved with a smoothing kernel. While the spectrogram suffers from a large amount of smoothing, the WVD produces undesirable interference terms. Cohen’s class representations often sit between these two extrema and are designed with a certain trade-off between smoothing and interference attenuation in mind, e.g. the smoothed pseudo WVD [7].
∗Corresponding author Email addresses: [email protected] (Nicki Holighaus), [email protected] (Zdenek Prusa), [email protected] (Peter L. Søndergaard)
Preprint submitted to Signal Processing May 8, 2017 Instead of reducing the overall smoothing, representations are often designed to locally provide a good trade- off between time and frequency smoothing with respect to expected signal characteristics. Such representations include (non-uniform) filter banks (FB) [8, 9], in particular, wavelet filter banks [10, 11] (often referred to as time-scale representations), as well as nonstationary Gabor transforms [12, 13] or the representations presented in [14, 15]. Here, we recall the RM originally proposed by Kodera et al. [16] to obtain a sharper time-frequency picture. RM attempts the deconvolution of the spectrogram by means of the partial derivatives of the phase of the underlying STFT. The resulting reassigned spectrogram achieves a sharp representation similar to the WVD, but with little interference. Reassignment was made popular by Auger and Flandrin [17, 18], who generalized the method beyond the spectrogram to general Cohen’s class time-frequency representations and an equivalent class of time-scale representations. Most importantly, they discovered an efficient means of computation and the application of reassignment. In contrast to the previously mentioned methods, the reassigned spectrogram is not bilinear and even approximate reconstruction is a nontrivial task, although it has been attempted, see [19] for a method based on additive sound modeling or [20]. Therefore, although a valuable analysis tool, RM is deemed unsuitable for signal processing so far. The SST is a variant of RM dealing with the reconstruction problem, proposed by Daubechies et al. [21, 22, 23]. The main differences between RM and SST are that the latter acts on a complex valued time-frequency representation, hence preserving phase information, and only performs frequency reassignment. For recent work on the implementation and application of synchrosqueezing transforms, see e.g. [24, 25, 26, 27] and references therein, or [28] for an approach trying to fuse the invertibility of SST with the superior localization of RM. In this contribution, we extend RM and SST to filter banks. Furthermore, we demonstrate that the reassigned FB representations, with respect to a (over-)complete system, can be used as an interface for processing. Hence, the benefits of a sharpened representation can be combined with the invertibility of the underlying filter bank representation.
2. Preliminaries
Although the results presented in this contribution are applicable for discrete signals, we will present the theoretical material for continuous time signals f ∈ L2(R) to stay consistent with previous publications on RM and SST. We use the following unitary variant of the Fourier transform F : L2(R) 7→ L2(R), fˆ(ξ) := Ff(ξ) = R f(t)e−2πiξt dt. R
Localized functions: We will often require a function to be well-localized around a point x ∈ R in a rather informal manner. We consider a function well-localized around x if it decays sufficiently fast away from (a small area around) x. Although none of the methods presented below strictly require any localization, all
2 of them benefit from filters (or window functions) with unique global time and frequency maxima x and ω and better than linear decay around those maxima. Since practically every common time-frequency relevant filter has these properties, this is no severe restriction.
Short-time Fourier transforms, analysis filter banks and frames: For a signal f ∈ L2(R) and 2 2πiω(·−x) window function g ∈ L (R), let gx,ω = g(· − x)e . The STFT of f with respect to g is given by
q 2πiφ(x,ω) Vgf(x, ω) = hf, gx,ωi = Sgf(x, ω)e , (1)
2 2 where Sgf = |Vgf| is the spectrogram and φ : R → R is the phase of the STFT. The collection G(g) :=
{gx,ω}x,ω∈R is the (continuous) Gabor system with respect to g. As usual, we assume that g, gˆ are well-localized around 0. 2 + Let g = {gm ∈ L (R)}m∈Z, a = {am ∈ R }m∈Z a sequence of functions and time steps, respectively. The + system {gn,m : am ∈ R }n,m∈Z with gn,m = gm(· − nam) is a filter bank (FB). We call {gx,m}x∈R,m∈Z with gx,m = gm(· − x) a time-continuous filter bank.
For convenience, we assume that gm is continuous, well-localized around time 0 and gcm is well-localized around center frequency ωm with ωj < ωm if j < m and limm→−∞ ωm = −∞, limm→∞ ωm = ∞. The associated filter bank analysis is given by
cn,m := cf [n, m] := hf, gn,mi. (2)
Wavelet or other filter banks with an accumulation of center frequencies at frequency 0 are completely valid FBs for all the results presented here, but require a slightly less straightforward indexing scheme. 2 2 A filter bank forms a frame, if there are constants 0 < A ≤ B < ∞, such that Akfk2 ≤ kcf k2 ≤ 2 2 Bkfk2, for all f ∈ L (R). The frame property guarantees the stable invertibility of the coefficient mapping by means of a dual frame {g } , i.e. gn,m n,m∈Z
X 2 f = cf [n, m]ggn,m, for all f ∈ L (R). (3) n,m
3. Proposed method and statement of results
In this section, we show how to perform reassignment and synchrosqueezing for filter banks, show how the inverse reassignment map can be used in conjunction with invertible filter banks and provide a sufficient condition enabling inversion from synchrosqueezed filter bank coefficients. The derivation of the formulas and conditions can be found in the subsequent sections. + T Let {gn,m : am ∈ R }n,m∈Z be a FB with center frequencies ωm. We define gm(t) := tgm(t) and
F 2πiωmt −2πiωmt 0 gm(t) := e (e gm) (t), for all t ∈ R, m ∈ Z. With
T T F F cf [n, m] = hf, gm(· − nam)i and cf [n, m] = hf, gm(· − nam)i, (4)
3 we obtain the reassignment operators