Spectral Estimation Using Multitaper Whittle Methods with a Lasso Penalty Shuhan Tang, Peter F

Total Page:16

File Type:pdf, Size:1020Kb

Spectral Estimation Using Multitaper Whittle Methods with a Lasso Penalty Shuhan Tang, Peter F IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. , NO. , MONTH YEAR 1 Spectral Estimation Using Multitaper Whittle Methods with a Lasso Penalty Shuhan Tang, Peter F. Craigmile, and Yunzhang Zhu Last updated July 25, 2019 Abstract—Spectral estimation provides key insights into the can compromise estimation (see [1], ch.9, and references frequency domain characteristics of a time series. Naive non- therein). parametric estimates of the spectral density, such as the peri- A popular alternative approach is to consider a semipara- odogram, are inconsistent, and the more advanced lag window or multitaper estimators are often still too noisy. We propose metric model for the SDF, in which the log SDF is expressed an L1 penalized quasi-likelihood Whittle framework based on in terms of a truncated basis expansion, where the number multitaper spectral estimates which performs semiparametric of basis functions are allowed to increase with the sample spectral estimation for regularly sampled univariate stationary size. The statistical problem then becomes how to enforce time series. Our new approach circumvents the problematic sparsity by selecting the basis functions and estimating the Gaussianity assumption required by least square approaches and achieves sparsity for a wide variety of basis functions. We model parameters so that we adequately estimate the SDF, present an alternating direction method of multipliers (ADMM) but also have computational efficiency as the sample size algorithm to efficiently solve the optimization problem, and is increased. Gao [3] [4], Moulin [5] and Walden et al. [6] develop universal threshold and generalized information criterion enforce sparsity using a penalized least square (LS) approach (GIC) strategies for efficient tuning parameter selection that for estimating the log SDF with wavelet soft thresholding. outperform cross-validation methods. Theoretically, a fast conver- gence rate for the proposed spectral estimator is established. We In terms of computational complexity, wavelet thresholding demonstrate the utility of our methodology on simulated series methods are typically O(N), for a time series of N regularly and to the spectral analysis of electroencephalogram (EEG) data. sampled values. A number of approaches enforce smoothness of the SDF via Index Terms—Alternating direction method of multipliers an L2 penalty: Cogburn and Davis [7], Wahba and Wold [8] (ADMM) algorithm; basis expansion; multitaper spectral esti- and Wahba [9] use penalized LS, and Pawitan and O’Sullivan mates; wavelets. [10] uses a penalized Whittle method. To enforce sparsity, some of these L2 methods of smoothing splines also use I. INTRODUCTION model selection, often in combination with cross-validation, STIMATING the spectral density function (SDF) or to select the basis functions that are used to model the SDF. E spectrum of a series collected over time is an important Alternatively, one can implement methods such as [11] to tool in time series analysis and signal processing. It is used enforce sparsity on the basis expansion directly. (On a related in many fields such as astronomy, cognitive science, earth topic, smoothness of spectral estimation can also be tuned sciences, electrical engineering, and finance. Examining the with high-resolution approaches introduced in [12], [13] and SDF allows us to explore periodicities in the data (e.g., [1, ch. using extended frameworks based on so-called beta and tau 10]), provides an alternative way to analyze and estimate the divergence families, such as [14]–[18]; see [19] for a general covariance structure of stationary time series (e.g., [1, ch. 4]), review of such divergences.) and can also be used to understand the effect of preprocessing Our method is also motivated by the need to enforce sparsity a time series (e.g., [2]). while adequately estimating the SDF. In addition, we seek There are many nonparametric estimators of the SDF of computational efficiency as we increase the sample size. We a univariate stationary time series. These include the peri- develop a quasi-likelihood method for estimating SDFs using odogram, direct spectral estimators, lag window and overlap- a Whittle likelihood [20] based on MT spectral estimates. A ping segment averaging spectral estimators, and multitaper quasi-likelihood function [21] [22, ch. 9] has similar statistical (MT) spectral estimators. (See [1] for a complete review.) properties to that of the log likelihood, and can be used for While many of these estimators are developed to provide statistical inference, but does not have to match exactly to an adequate tradeoff between bias and variance, often these the log of the joint probability density function of the data. nonparametric estimates are still too noisy when a stable MT estimates [23] [1] provide a good compromise between estimate of the SDF is required. An alternative strategy is to bias and variance and can yield more efficient estimates of use a parametric approach, however model misspecification the SDF [6]. We demonstrate that the addition of a Whittle caused by considering a limited class of models for the SDF, likelihood method [20] improves estimation over traditional LS approaches. S. Tang, P. F. Craigmile, Y. Zhu are with the Department of Statis- We use a lasso penalty [24] to enforce sparsity, deriving two tics, The Ohio State University, Columbus, OH, 43210 USA (e-mail: [email protected]). strategies to optimally select the tuning parameter that is key to Manuscript received December 12, 2018; revised June 27, 2019. obtaining estimates of the SDF with low integrated root mean IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. , NO. , MONTH YEAR 2 squared error (IRMSE): “universal threshold” and generalized There are a variety of options for families of basis functions information criterion (GIC)-based methods. Neither method φ(f) that can be used for spectral estimation. For example, compromises on computational or statistical efficiency by polynomial and Fourier bases can be used to capture global requiring the use of cross-validation to select the tuning param- patterns [26], smoothing splines allow for local and smooth eter. Theoretically, we derive the rate of convergence for our patterns in the SDF [7], [8], and wavelet bases model local proposed spectral estimator under some technical conditions behaviour, while capturing second order effects such as peaks, on the model sparsity and the MT spectral estimator. troughs, and cusps [3]–[6]. When the SDF is spatially inho- We introduce a computationally efficient method to estimate mogeneous, spatially adaptive bases, such as wavelets, have the parameters in our model using the alternating direction theoretical optimality properties (see, e.g., [27]–[29]). We can method of multipliers (ADMM) algorithm. To reach an - also combine families of basis functions to form dictionaries. optimal solution with a time series of length N, our method In Section IV we propose our estimator of the coefficient is O(N−1) when using wavelet bases and O(N 3 + −1N 2) vector β, which is based on multitaper spectral estimates of for general bases. Although computationally more challenging, time series data that we define in the next section. (In Sec- our method can be applied to SDF estimation using any collec- tion VII we provide assumptions on the basis representation tion of basis functions and, as mentioned above, outperforms (3) so that we can asymptotically recover the true log SDF.) LS-based methods such as wavelet thresholding in terms of estimation quality. III. MULTITAPER SPECTRAL ESTIMATION The rest of the paper is organized as follows. Section II T Suppose we observe N observations, X = (X1;:::;XN ) , presents models for SDFs in terms of basis methods. In Sec- from the stationary process fXtg. Then a multitaper (MT) tion III, we introduce the multitaper spectral estimator that we or multiple taper spectral estimate [23], is an average of a use in our penalized Whittle estimation method in Section IV. number of tapered spectral estimates. Specifically, let fhk;t : The ADMM algorithm is described in Section V and Section k = 1; : : : ; K; t = 1;:::;Ng denote K orthonormal data VI outlines two approaches for tuning parameter selection. We P 2 P 0 tapers; i.e., hk;t = 1 and hk;thk0;t = 0 for k 6= k . derive the rate of convergence for the proposed L penalized t t 1 Then the standard MT spectral estimator of the SDF Sb(mt)(f), MT-Whittle likelihood estimator in Section VII. Our methods is the average of the K eigenspectra, are evaluated using Monte Carlo simulations in Section VIII K and we perform a spectral analysis of electroencephalogram 1 X (mt) Sb(mt)(f) = Sb (f); (4) (EEG) data in Section IX. We close with some remarks in K k Section X. Proofs and further details of the ADMM algorithm k=1 are provided in the Appendix. where the kth (k = 1;:::;K) eigenspectrum is defined by (mt) 2 Sbk (f) = jJk(f)j ; with II. BASIS MODELS FOR SDFS N X Let fXt : t 2 Zg be a univariate real-valued stationary Jk(f) = hk;tXt exp(−i2πft): process collected at sampling interval ∆ > 0. Without loss of t=1 generality assume ∆ = 1. Let γ(h) = cov(Xt;Xt+h), h 2 Z, Different tapers induce different statistical properties for the denote the (stationary) autocovariance function (ACVF) of MT estimator. Discrete prolate spheroidal sequences (DPSS) fXtg and assume that the ACVF is absolutely summable: and sine tapers are most commonly used [1]. DPSS tapers are P1 h=−∞ jγ(h)j < 1: Then the spectral density function designed to reduce the sidelobes in the spectral estimate. They (SDF) S(f) for a frequency jfj < 1=2 exists and is defined solve the time-frequency concentration problem in which we as the Fourier transform pair of the ACVF: find the time limited sequence which has most of its energy 1 concentrated in a specified frequency band [1, ch. 8]. We use X S(f) = γ(h)e−i2πfh; (1) the easily calculated sine tapers [30], h=−∞ 1=2 Z 1=2 2 (k + 1)πt i2πfh h = sin ; with γ(h) = e S(f)df: (2) k;t N + 1 N + 1 −1=2 k = 1; : : : ; K; t = 1; : : : ; N; The SDF is a non-negative, even, and real-valued function and decomposes the variance of the time series fXtg: from which are designed to reduce the smoothing bias, at a com- R 1=2 promise to sidelobe reduction.
Recommended publications
  • An Overview of Psd: Adaptive Sine Multitaper Power Spectral Density Estimation in R
    An overview of psd: Adaptive sine multitaper power spectral density estimation in R Andrew J. Barbour and Robert L. Parker June 28, 2020 Abstract This vignette provides an overview of some features included in the package psd, designed to compute estimates of power spectral density (PSD) for a univariate series in a sophisticated manner, with very little tuning effort. The sine multitapers are used, and the number of tapers varies with spectral shape, according to the optimal value proposed by Riedel and Sidorenko (1995). The adaptive procedure iteratively refines the optimal number of tapers at each frequency based on the spectrum from the previous iteration. Assuming the adaptive procedure converges, this produces power spectra with significantly lower spectral variance relative to results from less-sophisticated estimators. Sine tapers exhibit excellent leakage suppression characteristics, so bias effects are also reduced. Resolution and uncertainty vary with the number of tapers, which means we do not need to resort to either (1) windowing methods, which inherently degrade resolution at low-frequency (e.g. Welch’s method); or (2) smoothing kernels, which can badly distort important features without careful tuning (e.g. the Daniell kernel in stats::spectrum). In this regards psd is best suited for data having large dynamic range and some mix of narrow and wide-band structure, features typically found in geophysical datasets. Contents 1 Quick start: A minimal example. 2 2 Comparisons with other methods 5 2.1 stats::spectrum ............................................5 2.2 RSEIS::mtapspec ............................................7 2.3 multitaper::spec.mtm ........................................ 15 2.4 sapa::SDF ................................................ 15 2.5 bspec::bspec .............................................
    [Show full text]
  • Localized Spectral Analysis on the Sphere Mark Wieczorek, Frederik Simons
    Localized spectral analysis on the sphere Mark Wieczorek, Frederik Simons To cite this version: Mark Wieczorek, Frederik Simons. Localized spectral analysis on the sphere. Geophysical Jour- nal International, Oxford University Press (OUP), 2005, 162 (3), pp.655-675. 10.1111/j.1365- 246X.2005.02687.x. hal-02458533 HAL Id: hal-02458533 https://hal.archives-ouvertes.fr/hal-02458533 Submitted on 30 Jul 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Geophys. J. Int. (2005) 162, 655–675 doi: 10.1111/j.1365-246X.2005.02687.x Localized spectral analysis on the sphere ∗ Mark A. Wieczorek1 and Frederik J. Simons2, 1D´epartement de G´eophysique Spatiale et Plan´etaire, Institut de Physique du Globe de Paris, France. E-mail: [email protected] 2Geosciences Department, Guyot Hall, Princeton University, Princeton, NJ 08540, USA Accepted 2005 May 13. Received 2005 May 9; in original form 2004 October 12 SUMMARY Downloaded from https://academic.oup.com/gji/article-abstract/162/3/655/2097585 by guest on 30 July 2020 It is often advantageous to investigate the relationship between two geophysical data sets in the spectral domain by calculating admittance and coherence functions.
    [Show full text]
  • Comparing Open-Source Toolboxes for Processing and Analysis of Spike
    bioRxiv preprint doi: https://doi.org/10.1101/600486; this version posted April 12, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made Unakafova et al. available under aCC-BY 4.0 International licenseOpen-source. spike and LFP toolboxes 1 Comparing open-source toolboxes for 2 processing and analysis of spike and local field 3 potentials data 4 5 Valentina A. Unakafova1,* and Alexander Gail1,2,3,4 6 1Cognitive Neurosciences Laboratory, German Primate Center, Goettingen, Germany 7 2Leibniz ScienceCampus, Primate Cognition, Goettingen, Germany 8 3Georg-Elias-Mueller-Institute of Psychology, University of Goettingen, Goettingen, Germany 9 4Bernstein Center for Computational Neuroscience, Goettingen, Germany 10 *Correspondence: [email protected] 11 ABSTRACT. 12 Analysis of spike and local field potential (LFP) data is an essential part of neuroscientific research. Today 13 there exist many open-source toolboxes for spike and LFP data analysis implementing various functionality. 14 Here we aim to provide a practical guidance for neuroscientists in the choice of an open-source toolbox best 15 satisfying their needs. We overview major open-source toolboxes for spike and LFP data analysis as well as 16 toolboxes with tools for connectivity analysis, dimensionality reduction and generalized linear modeling. We 17 focus on comparing toolboxes functionality, statistical and visualization tools, documentation and support 18 quality. To give a better insight, we compare and illustrate functionality of the toolboxes on open-access 19 dataset or simulated data and make corresponding MATLAB scripts publicly available.
    [Show full text]
  • Multitaper Analysis of Fundamental Frequency Variations During Voiced Fricatives
    Shadle & Ramsay: Multitaper harmonic analysis of fricatives MULTITAPER ANALYSIS OF FUNDAMENTAL FREQUENCY VARIATIONS DURING VOICED FRICATIVES Christine Shadle & Gordon Ramsay School of Electronics & Computer Science, University of Southampton ABSTRACT: A method for tracking fundamental frequency variations in speech is proposed, based on multitaper anal- ysis. Using the multitaper technique, a statistical test is developed for detecting the presence of harmonic components at multiples of a fundamental frequency, embedded in coloured noise. It is shown that this can be applied to speech to estimate the fundamental frequency, when present, as well as the amplitude and phase of each harmonic. The method is validated on synthetic data, to determine accuracy and robustness, and evaluated on a small corpus of real speech data, comparing simultaneous acoustic and electroglottographic measurements to assess performance. Acoustic mea- surements are marginally less accurate than electroglottographic measurements, but often continue to provide useful fundamental frequency estimates in situations where electroglottography fails. INTRODUCTION Variations in the fundamental frequency of the voice (F0) are known to provide important perceptual cues in speech. Long- term fundamental frequency variations associated with different intonation patterns typically indicate higher-level linguistic or extra-linguistic information. However, short-term perturbations of the F0 contour also occur automatically when a con- striction is formed in the vocal tract, or when vocal fold vibration is affected by aerodynamic factors; these perturbations are known to provide acoustic cues for individual consonants (Ohde, 1984; Baken & Orlikoff, 1988). Previous studies have used both acoustic and electroglottographic measurements to examine short-term fundamental frequency variations and devoicing phenomena in fricatives (e.g.
    [Show full text]
  • State-Space Multitaper Spectrogram Algorithms: Theory and Applications
    State-Space Multitaper Spectrogram Algorithms: Theory and Applications by Michael K. Behr S.B., Massachusetts Institute of Technology (2015) Submitted to the Department of Electrical Engineering and Computer Science in partial fulfillment of the requirements for the degree of Master of Engineering in Electrical Engineering and Computer Science at the MASSACHUSETTS INSTITUTE OF TECHNOLOGY June 2016 © Massachusetts Institute of Technology 2016. All rights reserved. Author...................................................................... Department of Electrical Engineering and Computer Science May 20, 2016 Certified by. Dr. Emery N. Brown Professor Thesis Supervisor Accepted by................................................................. Dr. Christopher J. Terman Chairman, Masters of Engineering Thesis Committee State-Space Multitaper Spectrogram Algorithms: Theory and Applications by Michael K. Behr Submitted to the Department of Electrical Engineering and Computer Science on May 20, 2016, in partial fulfillment of the requirements for the degree of Master of Engineering in Electrical Engineering and Computer Science Abstract I present the state-space multitaper approach for analyzing non-stationary time series. Non- stationary time series are commonly divided into small time windows for analysis, but ex- isting methods lose predictive power by analyzing each window independently, even though nearby windows have similar spectral properties. The state-space multitaper algorithm com- bines two approaches for spectral analysis: the state-space approach models the relations between nearby windows, and the multitaper approach balances a bias-variance tradeoff in- herent in Fourier analysis of finite interval data. I illustrate an application of the algorithm to real-time anesthesia monitoring, which could prevent traumatic cases of intraoperative awareness. I discuss issues including a real-time implementation and modeling the system’s noise parameters.
    [Show full text]
  • Some Advances in the Multitaper Method of Spectrum Estimation
    Some Advances in the Multitaper Method of Spectrum Estimation by Kyle Quentin Lepage A thesis submitted to the Department of Mathematics and Statistics in conformity with the requirements for the degree of Doctor of Philosophy Queen’s University Kingston, Ontario, Canada February 2009 Copyright c Kyle Quentin Lepage, 2009 Abstract Four contributions to the multitaper method of applied spectrum estimation are pre- sented. These are a generalization of the multitaper method of spectrum estimation to time-series possessing irregularly spaced samples, a robust spectrum estimate suit- able for cyclostationary, or quasi-cyclostationary time-series, an improvement over the standard, multitaper spectrum estimates using quadratic inverse theory, and finally a method of scan-free spectrum estimation using a rotational shear-interferometer. Each of these topics forms a chapter in this thesis. i Co-Authorship Chapters 2, 3, 4 are co-authored with my supervisor, David J. Thomson. Chapter 5, involving scan-free spectrum estimation with a rotational shear interferometer is co-authored with Shawn Kraut, David J. Brady, and David J. Thomson. The work is a continuation of work begun by David J. Brady, worked on by Shawn Kraut while a post-doctoral researcher at Duke University, and finished by myself and David J. Thomson. ii Acknowledgments I thank David J. Thomson, Shawn Kraut and Mark Halpern for invaluable mentor- ship. I would like to thank Ryan Graham, Karim Rahim, and Azadeh Moghtaderi for numerous enlightening discussions on time-series analysis and statistics, my loving parents Edmond Lepage and Irene Makar, and the rest of the Makar clan for their continual support, and finally my beautiful partner and companion, Elsa Hansen, who has, during these past four years, made the finer things sweeter and the lows not quite as deep.
    [Show full text]
  • Multitaper Spectral Estimation HDP-Hmms for EEG Sleep Inference
    Multitaper Spectral Estimation HDP-HMMs for EEG Sleep Inference Leon Chlon∗ Andrew H. Song∗ Sandya Subramanian MIT Department of BCS MIT Department of EECS Harvard-MIT HST Cambridge, MA 02139 Cambridge, MA 02139 Cambridge, MA 02139 [email protected] [email protected] [email protected] Hugo Soulat John Tauber Demba Ba MIT Department of BCS MIT Department of BCS SEAS Department of Harvard Cambridge, MA 02139 Cambridge, MA 02139 Cambridge, MA 02138 [email protected] [email protected] [email protected] Michael Prerau MGH Department of Anesthesia, Critical Care and Pain Medicine 55 Fruit st, GRJ 4, Boston, MA 02114 [email protected] Abstract Electroencephalographic (EEG) monitoring of neural activity is widely used for sleep disorder diagnostics and research. The standard of care is to manually classify 30-second epochs of EEG time-domain traces into 5 discrete sleep stages. Unfortunately, this scoring process is subjective and time-consuming, and the defined stages do not capture the heterogeneous landscape of healthy and clinical neural dynamics. This motivates the search for a data-driven and principled way to identify the number and composition of salient, reoccurring brain states present during sleep. To this end, we propose a Hierarchical Dirichlet Process Hidden Markov Model (HDP-HMM), combined with wide-sense stationary (WSS) time series spectral estimation to construct a generative model for personalized subject sleep states. In addition, we employ multitaper spectral estimation to further reduce the large variance of the spectral estimates inherent to finite-length EEG measurements. By applying our method to both simulated and human sleep data, we arrive at three main results: 1) a Bayesian nonparametric automated algorithm that recovers general temporal dynamics of sleep, 2) identification of subject-specific arXiv:1805.07300v1 [stat.ML] 18 May 2018 "microstates" within canonical sleep stages, and 3) discovery of stage-dependent sub-oscillations with shared spectral signatures across subjects.
    [Show full text]
  • Multitaper Analysis of Evolutionary Spectra from Multivariate Spiking Observations
    1 Multitaper Analysis of Evolutionary Spectra from Multivariate Spiking Observations Anuththara Rupasinghe and Behtash Babadi Abstract—Extracting the spectral representations of the neural proposed to quantify the energy-frequency-time distributions processes that underlie spiking activity is key to understanding [17]–[26]. One notable example is the evolutionary power how the brain rhythms mediate cognitive functions. While spec- spectral characterization [22], [23], which defines a non- tral estimation of continuous time-series is well studied, inferring the spectral representation of latent non-stationary processes stationary spectral density matrix in order to quantify the based on spiking observations is a challenging problem. In this local spectral energy distributions at each instant of time for paper, we address this issue by developing a multitaper spectral a multivariate process. This characterization has a physical estimation methodology that can be directly applied to multi- interpretation similar to that of stationary Fourier spectral variate spiking observations in order to extract the evolutionary analysis, and reduces to the classical power spectra when spectral density of the latent non-stationary processes that drive spiking activity, based on point process theory. We establish the processes are stationary [22]. A unified approach that theoretical bounds on the bias-variance trade-off of the proposed considers multivariate spiking observations driven by non- estimator. Finally, we compare the performance of our proposed stationary latent processes is lacking, but highly desired due technique with existing methods using simulation studies and to the emerging demands of modern neuronal data analysis. application to real data, which reveal significant gains in terms In this work, we close this gap by developing a framework of the bias-variance trade-off.
    [Show full text]
  • Arxiv:1906.08451V1 [Eess.SP] 20 Jun 2019 .Introduction 1
    Multitaper Spectral Analysis of Neuronal Spiking Activity Driven by Latent Stationary Processes Proloy Dasa, Behtash Babadia,b,∗ aDepartment of Electrical and Computer Engineering bInstitute for Systems Research University of Maryland, College Park, MD 20742, USA Abstract Investigating the spectral properties of the neural covariates that underlie spiking activity is an important problem in systems neuroscience, as it allows to study the role of brain rhythms in cognitive functions. While the spectral estimation of continuous time-series is a well-established domain, computing the spectral representation of these neural covariates from spiking data sets forth various challenges due to the intrinsic non-linearities involved. In this paper, we address this problem by proposing a variant of the multitaper method specifically tailored for point process data. To this end, we construct auxiliary spiking statistics from which the eigen-spectra of the underlying latent process can be directly inferred using maximum likelihood estimation, and thereby the multitaper estimate can be efficiently computed. Comparison of our proposed technique to existing methods using simulated data reveals significant gains in terms of the bias-variance trade-off. Keywords: Power spectral density, point process models, neural signal processing, multitaper analysis 1. Introduction Spectral analysis techniques are among the most important tools for extracting information from time- series data recorded from naturally occurring processes. Examples include speech [1], image and video [2], and electroencephalography (EEG) [3] data. Due to the exploratory nature of most of these applications, nonparametric techniques based on Fourier methods and Wavelets are among the most widely used. In arXiv:1906.08451v1 [eess.SP] 20 Jun 2019 particular, the multitaper method excels among the available nonparametric techniques due to both its sim- plicity and control over the bias-variance trade-off by means of bandwidth adjustment[4–6].
    [Show full text]
  • Multitaper Estimation on Arbitrary Domains∗
    Multitaper estimation on arbitrary domains∗ Joakim And´eny and Jos´eLuis Romeroz Abstract. Multitaper estimators have enjoyed significant success in estimating spectral densities from finite samples using as tapers Slepian functions defined on the acquisition domain. Unfortunately, the numerical calculation of these Slepian tapers is only tractable for certain symmetric domains, such as rectangles or disks. In addition, no perfor- mance bounds are currently available for the mean squared error of the spectral density estimate. This situation is inadequate for applications such as cryo-electron microscopy, where noise models must be estimated from irregular domains with small sample sizes. We show that the multitaper estimator only depends on the linear space spanned by the tapers. As a result, Slepian tapers may be replaced by proxy tapers spanning the same subspace (validating the common practice of using partially converged solutions to the Slepian eigenproblem as tapers). These proxies may consequently be calculated using standard numerical algorithms for block diagonalization. We also prove a set of performance bounds for multitaper estimators on arbitrary domains. The method is demonstrated on synthetic and experimental datasets from cryo-electron microscopy, where it reduces mean squared error by a factor of two or more compared to traditional methods. Key words. spectral estimation, multitaper estimators, spatiospectral concentration, irregular domains, block eigendecom- position, cryo-electron microscopy AMS subject classifications. 62G05, 62M15, 62M40, 33E10, 15B05, 15A18, 42B10, 94A12, 65F15, 92C55 1. Introduction. Estimating the frequency content of a stochastic process is crucial to many data processing tasks. For example, in several inverse problems, such as denoising, access to a good noise model is necessary for accurate reconstruction.
    [Show full text]
  • Minimum Bias Multiple Taper Spectral Estimation Arxiv:1803.04078V2
    Minimum bias multiple taper spectral estimation ∗ Kurt S. Riedel and Alexander Sidorenko Courant Institute of Mathematical Sciences, New York University New York, New York 10012-1185 EDICS: SP 3.1.1 Abstract Two families of orthonormal tapers are proposed for multitaper spectral analysis: minimum bias tapers, and sinusoidal tapers fv(k)g, (k) q 2 πkn where vn = N+1 sin N+1 , and N is the number of points. The re- ^ 1 PK sulting sinusoidal multitaper spectral estimate is S(f) = 2K(N+1) j=1 jy(f+ j j 2 2N+2 )−y(f − 2N+2 )j , where y(f) is the Fourier transform of the sta- tionary time series, S(f) is the spectral density, and K is the number of tapers. For fixed j, the sinusoidal tapers converge to the minimum bias tapers like 1=N. Since the sinusoidal tapers have analytic ex- pressions, no numerical eigenvalue decomposition is necessary. Both the minimum bias and sinusoidal tapers have no additional parame- ter for the spectral bandwidth. The bandwidth of the jth taper is 1 ±j simply N centered about the frequencies 2N+2 . Thus the bandwidth of the multitaper spectral estimate can be adjusted locally by simply arXiv:1803.04078v2 [stat.ME] 30 Mar 2018 adding or deleting tapers. The band limited spectral concentration, R w 2 −w jV (f)j df, of both the minimum bias and sinusoidal tapers is very close to the optimal concentration achieved by the Slepian tapers. In R 1=2 2 2 contrast, the Slepian tapers can have the local bias, −1=2 f jV (f)j df, much larger than of the minimum bias tapers and the sinusoidal ta- pers.
    [Show full text]
  • Spatiospectral Concentration on a Sphere∗
    SIAM REVIEW c 2006 Society for Industrial and Applied Mathematics Vol. 48,No. 3,pp. 504–536 Spatiospectral Concentration on a Sphere∗ Frederik J. Simons† F. A. Dahlen‡ Mark A. Wieczorek§ Abstract. We pose and solve the analogue of Slepian’s time-frequency concentration problem on the surface of the unit sphere to determine an orthogonal family of strictly bandlimited functions that are optimally concentrated within a closed region of the sphere or, alterna- tively, of strictly spacelimited functions that are optimally concentrated in the spherical harmonic domain. Such a basis of simultaneously spatially and spectrally concentrated functions should be a useful data analysis and representation tool in a variety of geophysi- cal and planetary applications, as well as in medical imaging, computer science, cosmology, and numerical analysis. The spherical Slepian functions can be found by solving either an algebraic eigenvalue problem in the spectral domain or a Fredholm integral equation in the spatial domain. The associated eigenvalues are a measure of the spatiospectral concen- tration. When the concentration region is an axisymmetric polar cap, the spatiospectral projection operator commutes with a Sturm–Liouville operator; this enables the eigenfunc- tions to be computed extremely accurately and efficiently, even when their area-bandwidth product, or Shannon number, is large. In the asymptotic limit of a small spatial region and a large spherical harmonic bandwidth, the spherical concentration problem reduces to its planar equivalent, which exhibits self-similarity when the Shannon number is kept invariant. Key words. bandlimited function, commuting differential operator, concentration problem, eigenvalue problem, multitaper spectral analysis, spherical harmonics AMS subject classifications.
    [Show full text]