Analysis of Process Data with Singular Spectrum Methods

Total Page:16

File Type:pdf, Size:1020Kb

Analysis of Process Data with Singular Spectrum Methods ANALYSIS OF PROCESS DATA WITH SINGULAR SPECTRUM METHODS by Marlize Barkhuizen Thesis presented in partial fulfilment of the requirements for the Degree of Masters of Science in Engineering (Chemical Engineering) in the Department of Process Engineering at the University of Stellenbosch Supervised by: Prof C Aldrich STELLENBOSCH SOUTH AFRICA DECEMBER 2003 DECLARATION I, the undersigned, hereby declare that the work contained in this thesis is my own original work and that I have not previously in its entirety or in part submitted it at any university for a degree. Marlize Barkhuizen October 2003 SYNOPSIS The analysis of process data obtained from chemical and metallurgical engineering systems is a crucial aspect of the operating of any process, as information extracted from the data is used for control purposes, decision making and forecasting. Singular spectrum analysis (SSA) is a relatively new technique that can be used to decompose time series into their constituent components, after which a variety of further analyses can be applied to the data. The objectives of this study were to investigate the abilities of SSA regarding the filtering of data and the subsequent modelling of the filtered data, to explore the methods available to perform nonlinear SSA and finally to explore the possibilities of Monte Carlo SSA to characterize and identify process systems from observed time series data. Although the literature indicated the widespread application of SSA in other research fields, no previous application of singular spectrum analysis to time series obtained from chemical engineering processes could be found. SSA appeared to have a multitude of applications that could be of great benefit in the analysis of data from process systems. The first indication of this was in the filtering or noise-removal abilities of SSA. A number of case studies were filtered by various techniques related to SSA, after which a number of neural network modelling strategies were applied to the data. It was consistently found that the models built on data that have been prefiltered with SSA outperformed the other models. The effectiveness of localized SSA and auto-associative neural networks in performing nonlinear SSA were compared. Both techniques succeeded in extracting a number of nonlinear components from the data that could not be identified from linear SSA. However, it was found that localized SSA was a more reliable approach, as the auto-associative neural networks would not train for some of the data or extracted nonsensical components for other series. Lastly a number of time series were analysed using Monte Carlo SSA. It was found that, as is the case with all other characterization techniques, Monte Carlo SSA could not succeed in correctly classifying all the series investigated. For this reason several tests were used for the classification of the real process data. In the light of these findings, it was concluded that singular spectrum analysis could be a valuable tool in the analysis of chemical and metallurgical process data. i OPSOMMING Die analise van chemise en metallurgiese prosesdata wat verkry is vanaf chemiese of metallurgiese ingenieursstelsels is ‘n baie belangrike aspek in die bedryf van enige proses, aangesien die inligting wat van die data onttrek word vir prosesbeheer, besluitneming of die bou van prosesmodelle gebruik kan word. Singuliere spektrale analise is ‘n relatief nuwe tegniek wat gebruik kan word om tydreekse in hul onderliggende komponente te ontbind. Die doelwitte van hierdie studie was om ‘n omvattende literatuuroorsig oor die ontwikkeling van die tegniek en die toepassing daarvan te doen, beide in die ingenieursindustrie en in ander navorsingsvelde, die navors van die moontlikhede van SSA aangaande die verwydering van geraas uit die data en die gevolglike modellering van die skoon data te ondersoek, ‘n ondersoek te doen na sommige van die beskikbare tegnieke vir nie-lineêre SSA en laastens ‘n studie te maak van die potensiaal van Monte Carlo SSA vir die karakterisering en identifikasie van data verkry vanaf prosesstelsels. Ten spyte van aanduidings in die literatuur dat SSA wydverspreid toegepas word in ander navorsingsvelde, kon geen vorige toepassings gevind word van SSA op chemiese prosesse nie. Dit wil voorkom asof die chemiese nywerhede groot baat kan vind by SSA van prosesdata. Die eerste aanduiding van hierdie voordele was in die vermoë van SSA om geraas te verwyder uit tydreekse. ‘n Aantal tipiese gevalle is ondersoek deur van verskeie benaderings tot SSA gebruik te maak. Nadat die geraas uit die tydreekse van die toetsgevalle verwyder is, is neurale netwerke gebruik om die prosesse te modelleer. Daar is herhaaldelik gevind dat die modelle wat gebou is op data wat eers deur SSA skoongemaak is, beter presteer as die wat slegs op die onverwerkte data gepas is. Die effektiwiteit van lokale SSA en auto-assosiatiewe neurale netwerke om nie- lineêre SSA toe te pas is ook vergelyk. Albei tegnieke het daarin geslaag om nie- lineêre hoofkomponente van die data te onttrek wat nie geïdentifiseer kon word deur die lineêre benadering nie. Daar is egter gevind dat lokale SSA ‘n meer betroubare tegniek is, aangesien die auto- assosiatiewe neurale netwerke nie op sommige van die datastelle wou leer nie en vir ander tydreekse sinnelose hoofkomponente onttrek het. Laastens is ‘n aantal tydreekse geanaliseer met behulp van Monte Carlo SSA. Soos met alle ander karakteriseringstegnieke, kon Monte Carlo SSA nie daarin slaag om al die tydreekse wat ondersoek is korrek te identifiseer nie. Om hierdie rede is ‘n kombinasie van toetse gebruik om die onbekende tydreekse te klassifiseer. In die lig van al hierdie bevindinge, is die gevolgtrekking gemaak dat singuliere spektrale analise ‘n waardevolle hulpmiddel kan wees in die analise van chemiese en metallurgiese prosesdata. ii This is not the end. It is not even the beginning of the end. But it is the end of the beginning. Winston Churchill iii ACKNOWLEDGEMENTS My heavenly Father for giving me the abilities, strength and opportunities to come this far. My study leader, Prof Chris Aldrich, for his patient guidance, continued support and invaluable input in my work. My parents for believing in me and always having a word of support, encouragement and praise, as the need arose. My friends for showing interest, giving advice and ‘just being there’, even though my work was largely Greek to them. Fred for promising everything would turn out all right (it did). Mintek for their continued financial support and interest in my work. Acknowledgements iv TABLE OF CONTENTS TABLE OF CONTENTS v LIST OF FIGURES viii LIST OF TABLES xv 1. INTRODUCTION 1 2. LITERATURE REVIEW 3 2.1 ORIGINS OF SINGULAR SPECTRUM ANALYSIS ........................................................................... 3 2.2 SSA COMPARED TO OTHER SPECTRAL TIME SERIES ANALYSIS TECHNIQUES ............................. 4 2.2.1 Fourier analysis 4 2.2.2 Wavelet analysis 4 2.2.3 Maximum Entropy Method (MEM) 4 2.2.4 Multi-taper method (MTM) 5 2.3 TRENDS ANALYSIS WITH SSA................................................................................................. 5 2.3.1 Climatology and paleoclimatology 5 2.3.2 Biosciences 8 2.3.3 Economics 10 2.3.4 Geophysics 11 2.3.5 Engineering applications 12 2.3.6 Solar physics 14 2.4 OTHER APPLICATIONS OF SSA............................................................................................. 15 2.4.1 Process modelling and forecasting 15 2.4.2 Change point detection 16 3. METHODOLOGY 18 3.1 BASIC SINGULAR SPECTRUM ANALYSIS ........................................................................... 18 3.1.1 Decomposition stage 19 3.1.2 Reconstruction stage 22 3.1.3 Literature review on embedding and window length 23 3.1.4 Noise reduction 24 3.1.5 Interpretation of eigenelements 25 3.1.6 Prediction 25 3.2 MULTICHANNEL SINGULAR SPECTRUM ANALYSIS ............................................................. 25 3.3 MONTE CARLO SSA........................................................................................................ 26 3.3.1 Monte Carlo in the literature 28 3.3.2 General approach to Monte Carlo SSA 30 3.3.3 Generation of surrogate data 31 3.3.4 Choosing a test statistic 32 3.3.5 Classes of hypothesis 33 Table of Contents v 3.4 NONLINEAR SINGULAR SPECTRUM ANALYSIS .................................................................... 33 3.4.1 Nonlinear Principal Component Analysis 33 3.4.2 Localized Principal Component Analysis 36 3.5 OTHER APPROACHES TO SSA.......................................................................................... 38 3.5.1 Multi-scale singular spectrum analysis 38 3.5.2 Random lag singular spectrum analysis 38 3.5.3 Approximate projectors 38 4. FILTERING OF DATA WITH SSA 39 4.1 IDENTIFICATION OF PROCESS SYSTEM ................................................................................... 39 4.2 FLOW BETWEEN TWO NONINTERACTING TANKS IN SERIES ...................................................... 42 4.2.1 Generation of series and performing SSA 42 4.2.2 Modelling of data series 44 4.2.3 Influence of window length 45 4.2.4 Reconstructed attractor 47 4.3 CARBON-IN-LEACH PROCESS................................................................................................ 48 4.3.1 Background 48 4.3.2 Grouping of eigenvalues 50 4.3.3 Filtering of data 52 4.4 BASE METAL FLOTATION PLANT – ROUGHER, CLEANER AND SCAVENGER CIRCUITS ................. 54 4.4.1 Background 54 4.4.2 Embedding of plant data 55 4.4.3 Modelling of the cleaner,
Recommended publications
  • An Overview of Psd: Adaptive Sine Multitaper Power Spectral Density Estimation in R
    An overview of psd: Adaptive sine multitaper power spectral density estimation in R Andrew J. Barbour and Robert L. Parker June 28, 2020 Abstract This vignette provides an overview of some features included in the package psd, designed to compute estimates of power spectral density (PSD) for a univariate series in a sophisticated manner, with very little tuning effort. The sine multitapers are used, and the number of tapers varies with spectral shape, according to the optimal value proposed by Riedel and Sidorenko (1995). The adaptive procedure iteratively refines the optimal number of tapers at each frequency based on the spectrum from the previous iteration. Assuming the adaptive procedure converges, this produces power spectra with significantly lower spectral variance relative to results from less-sophisticated estimators. Sine tapers exhibit excellent leakage suppression characteristics, so bias effects are also reduced. Resolution and uncertainty vary with the number of tapers, which means we do not need to resort to either (1) windowing methods, which inherently degrade resolution at low-frequency (e.g. Welch’s method); or (2) smoothing kernels, which can badly distort important features without careful tuning (e.g. the Daniell kernel in stats::spectrum). In this regards psd is best suited for data having large dynamic range and some mix of narrow and wide-band structure, features typically found in geophysical datasets. Contents 1 Quick start: A minimal example. 2 2 Comparisons with other methods 5 2.1 stats::spectrum ............................................5 2.2 RSEIS::mtapspec ............................................7 2.3 multitaper::spec.mtm ........................................ 15 2.4 sapa::SDF ................................................ 15 2.5 bspec::bspec .............................................
    [Show full text]
  • Spectral Estimation Using Multitaper Whittle Methods with a Lasso Penalty Shuhan Tang, Peter F
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. , NO. , MONTH YEAR 1 Spectral Estimation Using Multitaper Whittle Methods with a Lasso Penalty Shuhan Tang, Peter F. Craigmile, and Yunzhang Zhu Last updated July 25, 2019 Abstract—Spectral estimation provides key insights into the can compromise estimation (see [1], ch.9, and references frequency domain characteristics of a time series. Naive non- therein). parametric estimates of the spectral density, such as the peri- A popular alternative approach is to consider a semipara- odogram, are inconsistent, and the more advanced lag window or multitaper estimators are often still too noisy. We propose metric model for the SDF, in which the log SDF is expressed an L1 penalized quasi-likelihood Whittle framework based on in terms of a truncated basis expansion, where the number multitaper spectral estimates which performs semiparametric of basis functions are allowed to increase with the sample spectral estimation for regularly sampled univariate stationary size. The statistical problem then becomes how to enforce time series. Our new approach circumvents the problematic sparsity by selecting the basis functions and estimating the Gaussianity assumption required by least square approaches and achieves sparsity for a wide variety of basis functions. We model parameters so that we adequately estimate the SDF, present an alternating direction method of multipliers (ADMM) but also have computational efficiency as the sample size algorithm to efficiently solve the optimization problem, and is increased. Gao [3] [4], Moulin [5] and Walden et al. [6] develop universal threshold and generalized information criterion enforce sparsity using a penalized least square (LS) approach (GIC) strategies for efficient tuning parameter selection that for estimating the log SDF with wavelet soft thresholding.
    [Show full text]
  • Localized Spectral Analysis on the Sphere Mark Wieczorek, Frederik Simons
    Localized spectral analysis on the sphere Mark Wieczorek, Frederik Simons To cite this version: Mark Wieczorek, Frederik Simons. Localized spectral analysis on the sphere. Geophysical Jour- nal International, Oxford University Press (OUP), 2005, 162 (3), pp.655-675. 10.1111/j.1365- 246X.2005.02687.x. hal-02458533 HAL Id: hal-02458533 https://hal.archives-ouvertes.fr/hal-02458533 Submitted on 30 Jul 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Geophys. J. Int. (2005) 162, 655–675 doi: 10.1111/j.1365-246X.2005.02687.x Localized spectral analysis on the sphere ∗ Mark A. Wieczorek1 and Frederik J. Simons2, 1D´epartement de G´eophysique Spatiale et Plan´etaire, Institut de Physique du Globe de Paris, France. E-mail: [email protected] 2Geosciences Department, Guyot Hall, Princeton University, Princeton, NJ 08540, USA Accepted 2005 May 13. Received 2005 May 9; in original form 2004 October 12 SUMMARY Downloaded from https://academic.oup.com/gji/article-abstract/162/3/655/2097585 by guest on 30 July 2020 It is often advantageous to investigate the relationship between two geophysical data sets in the spectral domain by calculating admittance and coherence functions.
    [Show full text]
  • Comparing Open-Source Toolboxes for Processing and Analysis of Spike
    bioRxiv preprint doi: https://doi.org/10.1101/600486; this version posted April 12, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made Unakafova et al. available under aCC-BY 4.0 International licenseOpen-source. spike and LFP toolboxes 1 Comparing open-source toolboxes for 2 processing and analysis of spike and local field 3 potentials data 4 5 Valentina A. Unakafova1,* and Alexander Gail1,2,3,4 6 1Cognitive Neurosciences Laboratory, German Primate Center, Goettingen, Germany 7 2Leibniz ScienceCampus, Primate Cognition, Goettingen, Germany 8 3Georg-Elias-Mueller-Institute of Psychology, University of Goettingen, Goettingen, Germany 9 4Bernstein Center for Computational Neuroscience, Goettingen, Germany 10 *Correspondence: [email protected] 11 ABSTRACT. 12 Analysis of spike and local field potential (LFP) data is an essential part of neuroscientific research. Today 13 there exist many open-source toolboxes for spike and LFP data analysis implementing various functionality. 14 Here we aim to provide a practical guidance for neuroscientists in the choice of an open-source toolbox best 15 satisfying their needs. We overview major open-source toolboxes for spike and LFP data analysis as well as 16 toolboxes with tools for connectivity analysis, dimensionality reduction and generalized linear modeling. We 17 focus on comparing toolboxes functionality, statistical and visualization tools, documentation and support 18 quality. To give a better insight, we compare and illustrate functionality of the toolboxes on open-access 19 dataset or simulated data and make corresponding MATLAB scripts publicly available.
    [Show full text]
  • Multitaper Analysis of Fundamental Frequency Variations During Voiced Fricatives
    Shadle & Ramsay: Multitaper harmonic analysis of fricatives MULTITAPER ANALYSIS OF FUNDAMENTAL FREQUENCY VARIATIONS DURING VOICED FRICATIVES Christine Shadle & Gordon Ramsay School of Electronics & Computer Science, University of Southampton ABSTRACT: A method for tracking fundamental frequency variations in speech is proposed, based on multitaper anal- ysis. Using the multitaper technique, a statistical test is developed for detecting the presence of harmonic components at multiples of a fundamental frequency, embedded in coloured noise. It is shown that this can be applied to speech to estimate the fundamental frequency, when present, as well as the amplitude and phase of each harmonic. The method is validated on synthetic data, to determine accuracy and robustness, and evaluated on a small corpus of real speech data, comparing simultaneous acoustic and electroglottographic measurements to assess performance. Acoustic mea- surements are marginally less accurate than electroglottographic measurements, but often continue to provide useful fundamental frequency estimates in situations where electroglottography fails. INTRODUCTION Variations in the fundamental frequency of the voice (F0) are known to provide important perceptual cues in speech. Long- term fundamental frequency variations associated with different intonation patterns typically indicate higher-level linguistic or extra-linguistic information. However, short-term perturbations of the F0 contour also occur automatically when a con- striction is formed in the vocal tract, or when vocal fold vibration is affected by aerodynamic factors; these perturbations are known to provide acoustic cues for individual consonants (Ohde, 1984; Baken & Orlikoff, 1988). Previous studies have used both acoustic and electroglottographic measurements to examine short-term fundamental frequency variations and devoicing phenomena in fricatives (e.g.
    [Show full text]
  • State-Space Multitaper Spectrogram Algorithms: Theory and Applications
    State-Space Multitaper Spectrogram Algorithms: Theory and Applications by Michael K. Behr S.B., Massachusetts Institute of Technology (2015) Submitted to the Department of Electrical Engineering and Computer Science in partial fulfillment of the requirements for the degree of Master of Engineering in Electrical Engineering and Computer Science at the MASSACHUSETTS INSTITUTE OF TECHNOLOGY June 2016 © Massachusetts Institute of Technology 2016. All rights reserved. Author...................................................................... Department of Electrical Engineering and Computer Science May 20, 2016 Certified by. Dr. Emery N. Brown Professor Thesis Supervisor Accepted by................................................................. Dr. Christopher J. Terman Chairman, Masters of Engineering Thesis Committee State-Space Multitaper Spectrogram Algorithms: Theory and Applications by Michael K. Behr Submitted to the Department of Electrical Engineering and Computer Science on May 20, 2016, in partial fulfillment of the requirements for the degree of Master of Engineering in Electrical Engineering and Computer Science Abstract I present the state-space multitaper approach for analyzing non-stationary time series. Non- stationary time series are commonly divided into small time windows for analysis, but ex- isting methods lose predictive power by analyzing each window independently, even though nearby windows have similar spectral properties. The state-space multitaper algorithm com- bines two approaches for spectral analysis: the state-space approach models the relations between nearby windows, and the multitaper approach balances a bias-variance tradeoff in- herent in Fourier analysis of finite interval data. I illustrate an application of the algorithm to real-time anesthesia monitoring, which could prevent traumatic cases of intraoperative awareness. I discuss issues including a real-time implementation and modeling the system’s noise parameters.
    [Show full text]
  • Some Advances in the Multitaper Method of Spectrum Estimation
    Some Advances in the Multitaper Method of Spectrum Estimation by Kyle Quentin Lepage A thesis submitted to the Department of Mathematics and Statistics in conformity with the requirements for the degree of Doctor of Philosophy Queen’s University Kingston, Ontario, Canada February 2009 Copyright c Kyle Quentin Lepage, 2009 Abstract Four contributions to the multitaper method of applied spectrum estimation are pre- sented. These are a generalization of the multitaper method of spectrum estimation to time-series possessing irregularly spaced samples, a robust spectrum estimate suit- able for cyclostationary, or quasi-cyclostationary time-series, an improvement over the standard, multitaper spectrum estimates using quadratic inverse theory, and finally a method of scan-free spectrum estimation using a rotational shear-interferometer. Each of these topics forms a chapter in this thesis. i Co-Authorship Chapters 2, 3, 4 are co-authored with my supervisor, David J. Thomson. Chapter 5, involving scan-free spectrum estimation with a rotational shear interferometer is co-authored with Shawn Kraut, David J. Brady, and David J. Thomson. The work is a continuation of work begun by David J. Brady, worked on by Shawn Kraut while a post-doctoral researcher at Duke University, and finished by myself and David J. Thomson. ii Acknowledgments I thank David J. Thomson, Shawn Kraut and Mark Halpern for invaluable mentor- ship. I would like to thank Ryan Graham, Karim Rahim, and Azadeh Moghtaderi for numerous enlightening discussions on time-series analysis and statistics, my loving parents Edmond Lepage and Irene Makar, and the rest of the Makar clan for their continual support, and finally my beautiful partner and companion, Elsa Hansen, who has, during these past four years, made the finer things sweeter and the lows not quite as deep.
    [Show full text]
  • Multitaper Spectral Estimation HDP-Hmms for EEG Sleep Inference
    Multitaper Spectral Estimation HDP-HMMs for EEG Sleep Inference Leon Chlon∗ Andrew H. Song∗ Sandya Subramanian MIT Department of BCS MIT Department of EECS Harvard-MIT HST Cambridge, MA 02139 Cambridge, MA 02139 Cambridge, MA 02139 [email protected] [email protected] [email protected] Hugo Soulat John Tauber Demba Ba MIT Department of BCS MIT Department of BCS SEAS Department of Harvard Cambridge, MA 02139 Cambridge, MA 02139 Cambridge, MA 02138 [email protected] [email protected] [email protected] Michael Prerau MGH Department of Anesthesia, Critical Care and Pain Medicine 55 Fruit st, GRJ 4, Boston, MA 02114 [email protected] Abstract Electroencephalographic (EEG) monitoring of neural activity is widely used for sleep disorder diagnostics and research. The standard of care is to manually classify 30-second epochs of EEG time-domain traces into 5 discrete sleep stages. Unfortunately, this scoring process is subjective and time-consuming, and the defined stages do not capture the heterogeneous landscape of healthy and clinical neural dynamics. This motivates the search for a data-driven and principled way to identify the number and composition of salient, reoccurring brain states present during sleep. To this end, we propose a Hierarchical Dirichlet Process Hidden Markov Model (HDP-HMM), combined with wide-sense stationary (WSS) time series spectral estimation to construct a generative model for personalized subject sleep states. In addition, we employ multitaper spectral estimation to further reduce the large variance of the spectral estimates inherent to finite-length EEG measurements. By applying our method to both simulated and human sleep data, we arrive at three main results: 1) a Bayesian nonparametric automated algorithm that recovers general temporal dynamics of sleep, 2) identification of subject-specific arXiv:1805.07300v1 [stat.ML] 18 May 2018 "microstates" within canonical sleep stages, and 3) discovery of stage-dependent sub-oscillations with shared spectral signatures across subjects.
    [Show full text]
  • Multitaper Analysis of Evolutionary Spectra from Multivariate Spiking Observations
    1 Multitaper Analysis of Evolutionary Spectra from Multivariate Spiking Observations Anuththara Rupasinghe and Behtash Babadi Abstract—Extracting the spectral representations of the neural proposed to quantify the energy-frequency-time distributions processes that underlie spiking activity is key to understanding [17]–[26]. One notable example is the evolutionary power how the brain rhythms mediate cognitive functions. While spec- spectral characterization [22], [23], which defines a non- tral estimation of continuous time-series is well studied, inferring the spectral representation of latent non-stationary processes stationary spectral density matrix in order to quantify the based on spiking observations is a challenging problem. In this local spectral energy distributions at each instant of time for paper, we address this issue by developing a multitaper spectral a multivariate process. This characterization has a physical estimation methodology that can be directly applied to multi- interpretation similar to that of stationary Fourier spectral variate spiking observations in order to extract the evolutionary analysis, and reduces to the classical power spectra when spectral density of the latent non-stationary processes that drive spiking activity, based on point process theory. We establish the processes are stationary [22]. A unified approach that theoretical bounds on the bias-variance trade-off of the proposed considers multivariate spiking observations driven by non- estimator. Finally, we compare the performance of our proposed stationary latent processes is lacking, but highly desired due technique with existing methods using simulation studies and to the emerging demands of modern neuronal data analysis. application to real data, which reveal significant gains in terms In this work, we close this gap by developing a framework of the bias-variance trade-off.
    [Show full text]
  • Arxiv:1906.08451V1 [Eess.SP] 20 Jun 2019 .Introduction 1
    Multitaper Spectral Analysis of Neuronal Spiking Activity Driven by Latent Stationary Processes Proloy Dasa, Behtash Babadia,b,∗ aDepartment of Electrical and Computer Engineering bInstitute for Systems Research University of Maryland, College Park, MD 20742, USA Abstract Investigating the spectral properties of the neural covariates that underlie spiking activity is an important problem in systems neuroscience, as it allows to study the role of brain rhythms in cognitive functions. While the spectral estimation of continuous time-series is a well-established domain, computing the spectral representation of these neural covariates from spiking data sets forth various challenges due to the intrinsic non-linearities involved. In this paper, we address this problem by proposing a variant of the multitaper method specifically tailored for point process data. To this end, we construct auxiliary spiking statistics from which the eigen-spectra of the underlying latent process can be directly inferred using maximum likelihood estimation, and thereby the multitaper estimate can be efficiently computed. Comparison of our proposed technique to existing methods using simulated data reveals significant gains in terms of the bias-variance trade-off. Keywords: Power spectral density, point process models, neural signal processing, multitaper analysis 1. Introduction Spectral analysis techniques are among the most important tools for extracting information from time- series data recorded from naturally occurring processes. Examples include speech [1], image and video [2], and electroencephalography (EEG) [3] data. Due to the exploratory nature of most of these applications, nonparametric techniques based on Fourier methods and Wavelets are among the most widely used. In arXiv:1906.08451v1 [eess.SP] 20 Jun 2019 particular, the multitaper method excels among the available nonparametric techniques due to both its sim- plicity and control over the bias-variance trade-off by means of bandwidth adjustment[4–6].
    [Show full text]
  • Multitaper Estimation on Arbitrary Domains∗
    Multitaper estimation on arbitrary domains∗ Joakim And´eny and Jos´eLuis Romeroz Abstract. Multitaper estimators have enjoyed significant success in estimating spectral densities from finite samples using as tapers Slepian functions defined on the acquisition domain. Unfortunately, the numerical calculation of these Slepian tapers is only tractable for certain symmetric domains, such as rectangles or disks. In addition, no perfor- mance bounds are currently available for the mean squared error of the spectral density estimate. This situation is inadequate for applications such as cryo-electron microscopy, where noise models must be estimated from irregular domains with small sample sizes. We show that the multitaper estimator only depends on the linear space spanned by the tapers. As a result, Slepian tapers may be replaced by proxy tapers spanning the same subspace (validating the common practice of using partially converged solutions to the Slepian eigenproblem as tapers). These proxies may consequently be calculated using standard numerical algorithms for block diagonalization. We also prove a set of performance bounds for multitaper estimators on arbitrary domains. The method is demonstrated on synthetic and experimental datasets from cryo-electron microscopy, where it reduces mean squared error by a factor of two or more compared to traditional methods. Key words. spectral estimation, multitaper estimators, spatiospectral concentration, irregular domains, block eigendecom- position, cryo-electron microscopy AMS subject classifications. 62G05, 62M15, 62M40, 33E10, 15B05, 15A18, 42B10, 94A12, 65F15, 92C55 1. Introduction. Estimating the frequency content of a stochastic process is crucial to many data processing tasks. For example, in several inverse problems, such as denoising, access to a good noise model is necessary for accurate reconstruction.
    [Show full text]
  • Minimum Bias Multiple Taper Spectral Estimation Arxiv:1803.04078V2
    Minimum bias multiple taper spectral estimation ∗ Kurt S. Riedel and Alexander Sidorenko Courant Institute of Mathematical Sciences, New York University New York, New York 10012-1185 EDICS: SP 3.1.1 Abstract Two families of orthonormal tapers are proposed for multitaper spectral analysis: minimum bias tapers, and sinusoidal tapers fv(k)g, (k) q 2 πkn where vn = N+1 sin N+1 , and N is the number of points. The re- ^ 1 PK sulting sinusoidal multitaper spectral estimate is S(f) = 2K(N+1) j=1 jy(f+ j j 2 2N+2 )−y(f − 2N+2 )j , where y(f) is the Fourier transform of the sta- tionary time series, S(f) is the spectral density, and K is the number of tapers. For fixed j, the sinusoidal tapers converge to the minimum bias tapers like 1=N. Since the sinusoidal tapers have analytic ex- pressions, no numerical eigenvalue decomposition is necessary. Both the minimum bias and sinusoidal tapers have no additional parame- ter for the spectral bandwidth. The bandwidth of the jth taper is 1 ±j simply N centered about the frequencies 2N+2 . Thus the bandwidth of the multitaper spectral estimate can be adjusted locally by simply arXiv:1803.04078v2 [stat.ME] 30 Mar 2018 adding or deleting tapers. The band limited spectral concentration, R w 2 −w jV (f)j df, of both the minimum bias and sinusoidal tapers is very close to the optimal concentration achieved by the Slepian tapers. In R 1=2 2 2 contrast, the Slepian tapers can have the local bias, −1=2 f jV (f)j df, much larger than of the minimum bias tapers and the sinusoidal ta- pers.
    [Show full text]