Particle Filters and Data Assimilation Arxiv:1709.04196V1 [Stat.CO] 13
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Neural Implementation of the Kalman Filter
A Neural Implementation of the Kalman Filter Robert C. Wilson Leif H. Finkel Department of Psychology Department of Bioengineering Princeton University University of Pennsylvania Princeton, NJ 08540 Philadelphia, PA 19103 [email protected] Abstract Recent experimental evidence suggests that the brain is capable of approximating Bayesian inference in the face of noisy input stimuli. Despite this progress, the neural underpinnings of this computation are still poorly understood. In this pa- per we focus on the Bayesian filtering of stochastic time series and introduce a novel neural network, derived from a line attractor architecture, whose dynamics map directly onto those of the Kalman filter in the limit of small prediction error. When the prediction error is large we show that the network responds robustly to changepoints in a way that is qualitatively compatible with the optimal Bayesian model. The model suggests ways in which probability distributions are encoded in the brain and makes a number of testable experimental predictions. 1 Introduction There is a growing body of experimental evidence consistent with the idea that animals are some- how able to represent, manipulate and, ultimately, make decisions based on, probability distribu- tions. While still unproven, this idea has obvious appeal to theorists as a principled way in which to understand neural computation. A key question is how such Bayesian computations could be per- formed by neural networks. Several authors have proposed models addressing aspects of this issue [15, 10, 9, 19, 2, 3, 16, 4, 11, 18, 17, 7, 6, 8], but as yet, there is no conclusive experimental evidence in favour of any one and the question remains open. -
Introduction to Ensemble Kalman Filters and the Data Assimilation Research Testbed
Introduction to Ensemble Kalman Filters and the Data Assimilation Research Testbed Jeffrey Anderson, Tim Hoar, Nancy Collins NCAR Institute for Math Applied to Geophysics ICAP Workshop; 11 May 2011 pg 1 What is Data Assimilation? Observations combined with a Model forecast… + …to produce an analysis (best possible estimate). ICAP Workshop; 11 May 2011 pg 2 Example: Estimating the Temperature Outside An observation has a value ( * ), ICAP Workshop; 11 May 2011 pg 3 Example: Estimating the Temperature Outside An observation has a value ( * ), and an error distribution (red curve) that is associated with the instrument. ICAP Workshop; 11 May 2011 pg 4 Example: Estimating the Temperature Outside Thermometer outside measures 1C. Instrument builder says thermometer is unbiased with +/- 0.8C gaussian error. ICAP Workshop; 11 May 2011 pg 5 Example: Estimating the Temperature Outside Thermometer outside measures 1C. The red plot is P ( T | T o ) , probability of temperature given that To was observed. € ICAP Workshop; 11 May 2011 pg 6 Example: Estimating the Temperature Outside We also have a prior estimate of temperature. The green curve is P ( T | C ) ; probability of temperature given all available prior information C . € ICAP Workshop; 11 May 2011 pg 7 € Example: Estimating the Temperature Outside Prior information C can include: 1. Observations of things besides T; € 2. Model forecast made using observations at earlier times; 3. A priori physical constraints ( T > -273.15C ); 4. Climatological constraints ( -30C < T < 40C ). ICAP Workshop; 11 May 2011 pg 8 Combining the Prior Estimate and Observation Prior Bayes P(To | T,C)P(T | C) Theorem: P(T | To,C) = P(To | C) Posterior: Probability of T given Likelihood: Probability that To is observations and observed if T is true value and given Prior. -
Kalman and Particle Filtering
Abstract: The Kalman and Particle filters are algorithms that recursively update an estimate of the state and find the innovations driving a stochastic process given a sequence of observations. The Kalman filter accomplishes this goal by linear projections, while the Particle filter does so by a sequential Monte Carlo method. With the state estimates, we can forecast and smooth the stochastic process. With the innovations, we can estimate the parameters of the model. The article discusses how to set a dynamic model in a state-space form, derives the Kalman and Particle filters, and explains how to use them for estimation. Kalman and Particle Filtering The Kalman and Particle filters are algorithms that recursively update an estimate of the state and find the innovations driving a stochastic process given a sequence of observations. The Kalman filter accomplishes this goal by linear projections, while the Particle filter does so by a sequential Monte Carlo method. Since both filters start with a state-space representation of the stochastic processes of interest, section 1 presents the state-space form of a dynamic model. Then, section 2 intro- duces the Kalman filter and section 3 develops the Particle filter. For extended expositions of this material, see Doucet, de Freitas, and Gordon (2001), Durbin and Koopman (2001), and Ljungqvist and Sargent (2004). 1. The state-space representation of a dynamic model A large class of dynamic models can be represented by a state-space form: Xt+1 = ϕ (Xt,Wt+1; γ) (1) Yt = g (Xt,Vt; γ) . (2) This representation handles a stochastic process by finding three objects: a vector that l describes the position of the system (a state, Xt X R ) and two functions, one mapping ∈ ⊂ 1 the state today into the state tomorrow (the transition equation, (1)) and one mapping the state into observables, Yt (the measurement equation, (2)). -
Lecture 19: Wavelet Compression of Time Series and Images
Lecture 19: Wavelet compression of time series and images c Christopher S. Bretherton Winter 2014 Ref: Matlab Wavelet Toolbox help. 19.1 Wavelet compression of a time series The last section of wavelet leleccum notoolbox.m demonstrates the use of wavelet compression on a time series. The idea is to keep the wavelet coefficients of largest amplitude and zero out the small ones. 19.2 Wavelet analysis/compression of an image Wavelet analysis is easily extended to two-dimensional images or datasets (data matrices), by first doing a wavelet transform of each column of the matrix, then transforming each row of the result (see wavelet image). The wavelet coeffi- cient matrix has the highest level (largest-scale) averages in the first rows/columns, then successively smaller detail scales further down the rows/columns. The ex- ample also shows fine results with 50-fold data compression. 19.3 Continuous Wavelet Transform (CWT) Given a continuous signal u(t) and an analyzing wavelet (x), the CWT has the form Z 1 s − t W (λ, t) = λ−1=2 ( )u(s)ds (19.3.1) −∞ λ Here λ, the scale, is a continuous variable. We insist that have mean zero and that its square integrates to 1. The continuous Haar wavelet is defined: 8 < 1 0 < t < 1=2 (t) = −1 1=2 < t < 1 (19.3.2) : 0 otherwise W (λ, t) is proportional to the difference of running means of u over successive intervals of length λ/2. 1 Amath 482/582 Lecture 19 Bretherton - Winter 2014 2 In practice, for a discrete time series, the integral is evaluated as a Riemann sum using the Matlab wavelet toolbox function cwt. -
Localization in the Ensemble Kalman Filter
Department of Meteorology Localization in the ensemble Kalman Filter Ruth Elizabeth Petrie A dissertation submitted in partial fulfilment of the requirement for the degree of MSc. Atmosphere, Ocean and Climate August 2008 Abstract Data assimilation in meteorology seeks to provide a current analysis of the state of the atmosphere to use as initial conditions in a weather forecast. This is achieved by using an estimate of a previous state of the system and merging that with observations of the true state of the system. Ensemble Kalman filtering is one method of data assimilation. Ensemble Kalman filters operate by using an ensemble, or statistical sample, of the state of a system. A known prior state of a system is forecast to an observation time, then observation is assimilated. Observations are assimilated according to a ratio of the errors in the prior state and the observations. An analysis estimate of the system and an analysis estimate of the of the errors associated with the analysis state are also produced. This project looks at some problems within ensemble Kalman filtering and how they may be overcome. Undersampling is a key issue, this is where the size of the ensemble is so small so as to not be statistically representative of the state of a system. Undersampling can lead to inbreeding, filter divergence and the development of long range spurious correlations. It is possible to implement counter measures. Firstly covariance inflation is used to combat inbreeding and the subsequent filter divergence. Covariance localization is primarily used to remove long range spurious correlations but also has the benefit increasing the effective ensemble size. -
Applying Particle Filtering in Both Aggregated and Age-Structured Population Compartmental Models of Pre-Vaccination Measles
bioRxiv preprint doi: https://doi.org/10.1101/340661; this version posted June 6, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Applying particle filtering in both aggregated and age-structured population compartmental models of pre-vaccination measles Xiaoyan Li1*, Alexander Doroshenko2, Nathaniel D. Osgood1 1 Department of Computer Science, University of Saskatchewan, Saskatoon, Saskatchewan, Canada 2 Department of Medicine, Division of Preventive Medicine, University of Alberta, Edmonton, Alberta, Canada * [email protected] Abstract Measles is a highly transmissible disease and is one of the leading causes of death among young children under 5 globally. While the use of ongoing surveillance data and { recently { dynamic models offer insight on measles dynamics, both suffer notable shortcomings when applied to measles outbreak prediction. In this paper, we apply the Sequential Monte Carlo approach of particle filtering, incorporating reported measles incidence for Saskatchewan during the pre-vaccination era, using an adaptation of a previously contributed measles compartmental model. To secure further insight, we also perform particle filtering on an age structured adaptation of the model in which the population is divided into two interacting age groups { children and adults. The results indicate that, when used with a suitable dynamic model, particle filtering can offer high predictive capacity for measles dynamics and outbreak occurrence in a low vaccination context. We have investigated five particle filtering models in this project. Based on the most competitive model as evaluated by predictive accuracy, we have performed prediction and outbreak classification analysis. -
Adaptive Ensemble Kalman Filtering of Nonlinear Systems
SERIES A DATA ASSIMILATION AND PREDICTABILITY PUBLISHED BY THE INTERNATIONAL METEOROLOGICAL INSTITUTE IN STOCKHOLM Adaptive ensemble Kalman filtering of non-linear systems By TYRUS BERRY and TIMOTHY SAUER*, Department of Mathematical Sciences, George Mason University, Fairfax, VA 22030, USA (Manuscript received 22 December 2012; in final form 4 June 2013) ABSTRACT A necessary ingredient of an ensemble Kalman filter (EnKF) is covariance inflation, used to control filter divergence and compensate for model error. There is an on-going search for inflation tunings that can be learned adaptively. Early in the development of Kalman filtering, Mehra (1970, 1972) enabled adaptivity in the context of linear dynamics with white noise model errors by showing how to estimate the model error and observation covariances. We propose an adaptive scheme, based on lifting Mehra’s idea to the non-linear case, that recovers the model error and observation noise covariances in simple cases, and in more complicated cases, results in a natural additive inflation that improves state estimation. It can be incorporated into non- linear filters such as the extended Kalman filter (EKF), the EnKF and their localised versions. We test the adaptive EnKF on a 40-dimensional Lorenz96 model and show the significant improvements in state estimation that are possible. We also discuss the extent to which such an adaptive filter can compensate for model error, and demonstrate the use of localisation to reduce ensemble sizes for large problems. Keywords: ensemble Kalman filter, data assimilation, non-linear dynamics, covariance inflation, adaptive filtering 1. Introduction white noise model error and observation error. One goal of this article is to introduce an adaptive scheme in the The Kalman filter is provably optimal for systems where the Gaussian white noise setting that is a natural generalisation dynamics and observations are linear with Gaussian noise. -
Stochastic Ensemble Kalman Filter with Ensemble Generation Using a Square Root Form
Stochastic ensemble Kalman filter with ensemble generation using a square root form: ensemble π-algorithm Ekaterina Klimova Institute of Computational Technologies SB RAS, Ac. Lavrentjev Ave., 6, Novosibirsk, 630090, Russia [email protected] 1. Introduction The Kalman filter algorithm is one of the most popular approaches to observation data assimilation. To implement the ensemble algorithm, the number of ensemble members must not be too large. Also, the ensemble must have a covariance matrix consistent with the covariances of analysis errors. There exist two approaches to the ensemble Kalman filter: a “stochastic filter” and a “deterministic filter”. A large body of research has been made to compare stochastic and deterministic filters. As shown in several studies, the stochastic ensemble Kalman filter has advantages over a deterministic filter. In [6], a stochastic version of the ensemble Kalman filter, which is implemented in the square root form (the ensemble π-algorithm) is proposed. The ensemble π–algorithm is what is called a stochastic filter in which an ensemble of analysis errors is generated by transforming an ensemble of forecast errors using a square root form. The transformation matrix does not depend on grid nodes. Therefore, the algorithm can be used locally, in a similar way as the LETKF algorithm [4] and can be implemented involving operations with matrices of size not greater than that of the ensemble. A new numerical implementation scheme of the ensemble π-algorithm is proposed. In particular, we consider a more general approach than that proposed in [6] to calculate the square root of a non-symmetric matrix. -
Alternative Tests for Time Series Dependence Based on Autocorrelation Coefficients
Alternative Tests for Time Series Dependence Based on Autocorrelation Coefficients Richard M. Levich and Rosario C. Rizzo * Current Draft: December 1998 Abstract: When autocorrelation is small, existing statistical techniques may not be powerful enough to reject the hypothesis that a series is free of autocorrelation. We propose two new and simple statistical tests (RHO and PHI) based on the unweighted sum of autocorrelation and partial autocorrelation coefficients. We analyze a set of simulated data to show the higher power of RHO and PHI in comparison to conventional tests for autocorrelation, especially in the presence of small but persistent autocorrelation. We show an application of our tests to data on currency futures to demonstrate their practical use. Finally, we indicate how our methodology could be used for a new class of time series models (the Generalized Autoregressive, or GAR models) that take into account the presence of small but persistent autocorrelation. _______________________________________________________________ An earlier version of this paper was presented at the Symposium on Global Integration and Competition, sponsored by the Center for Japan-U.S. Business and Economic Studies, Stern School of Business, New York University, March 27-28, 1997. We thank Paul Samuelson and the other participants at that conference for useful comments. * Stern School of Business, New York University and Research Department, Bank of Italy, respectively. 1 1. Introduction Economic time series are often characterized by positive -
Time Series: Co-Integration
Time Series: Co-integration Series: Economic Forecasting; Time Series: General; Watson M W 1994 Vector autoregressions and co-integration. Time Series: Nonstationary Distributions and Unit In: Engle R F, McFadden D L (eds.) Handbook of Econo- Roots; Time Series: Seasonal Adjustment metrics Vol. IV. Elsevier, The Netherlands N. H. Chan Bibliography Banerjee A, Dolado J J, Galbraith J W, Hendry D F 1993 Co- Integration, Error Correction, and the Econometric Analysis of Non-stationary Data. Oxford University Press, Oxford, UK Time Series: Cycles Box G E P, Tiao G C 1977 A canonical analysis of multiple time series. Biometrika 64: 355–65 Time series data in economics and other fields of social Chan N H, Tsay R S 1996 On the use of canonical correlation science often exhibit cyclical behavior. For example, analysis in testing common trends. In: Lee J C, Johnson W O, aggregate retail sales are high in November and Zellner A (eds.) Modelling and Prediction: Honoring December and follow a seasonal cycle; voter regis- S. Geisser. Springer-Verlag, New York, pp. 364–77 trations are high before each presidential election and Chan N H, Wei C Z 1988 Limiting distributions of least squares follow an election cycle; and aggregate macro- estimates of unstable autoregressive processes. Annals of Statistics 16: 367–401 economic activity falls into recession every six to eight Engle R F, Granger C W J 1987 Cointegration and error years and follows a business cycle. In spite of this correction: Representation, estimation, and testing. Econo- cyclicality, these series are not perfectly predictable, metrica 55: 251–76 and the cycles are not strictly periodic. -
Lecture 8 the Kalman Filter
EE363 Winter 2008-09 Lecture 8 The Kalman filter • Linear system driven by stochastic process • Statistical steady-state • Linear Gauss-Markov model • Kalman filter • Steady-state Kalman filter 8–1 Linear system driven by stochastic process we consider linear dynamical system xt+1 = Axt + But, with x0 and u0, u1,... random variables we’ll use notation T x¯t = E xt, Σx(t)= E(xt − x¯t)(xt − x¯t) and similarly for u¯t, Σu(t) taking expectation of xt+1 = Axt + But we have x¯t+1 = Ax¯t + Bu¯t i.e., the means propagate by the same linear dynamical system The Kalman filter 8–2 now let’s consider the covariance xt+1 − x¯t+1 = A(xt − x¯t)+ B(ut − u¯t) and so T Σx(t +1) = E (A(xt − x¯t)+ B(ut − u¯t))(A(xt − x¯t)+ B(ut − u¯t)) T T T T = AΣx(t)A + BΣu(t)B + AΣxu(t)B + BΣux(t)A where T T Σxu(t) = Σux(t) = E(xt − x¯t)(ut − u¯t) thus, the covariance Σx(t) satisfies another, Lyapunov-like linear dynamical system, driven by Σxu and Σu The Kalman filter 8–3 consider special case Σxu(t)=0, i.e., x and u are uncorrelated, so we have Lyapunov iteration T T Σx(t +1) = AΣx(t)A + BΣu(t)B , which is stable if and only if A is stable if A is stable and Σu(t) is constant, Σx(t) converges to Σx, called the steady-state covariance, which satisfies Lyapunov equation T T Σx = AΣxA + BΣuB thus, we can calculate the steady-state covariance of x exactly, by solving a Lyapunov equation (useful for starting simulations in statistical steady-state) The Kalman filter 8–4 Example we consider xt+1 = Axt + wt, with 0.6 −0.8 A = , 0.7 0.6 where wt are IID N (0, I) eigenvalues of A are -
Generating Time Series with Diverse and Controllable Characteristics
GRATIS: GeneRAting TIme Series with diverse and controllable characteristics Yanfei Kang,∗ Rob J Hyndman,† and Feng Li‡ Abstract The explosion of time series data in recent years has brought a flourish of new time series analysis methods, for forecasting, clustering, classification and other tasks. The evaluation of these new methods requires either collecting or simulating a diverse set of time series benchmarking data to enable reliable comparisons against alternative approaches. We pro- pose GeneRAting TIme Series with diverse and controllable characteristics, named GRATIS, with the use of mixture autoregressive (MAR) models. We simulate sets of time series using MAR models and investigate the diversity and coverage of the generated time series in a time series feature space. By tuning the parameters of the MAR models, GRATIS is also able to efficiently generate new time series with controllable features. In general, as a costless surrogate to the traditional data collection approach, GRATIS can be used as an evaluation tool for tasks such as time series forecasting and classification. We illustrate the usefulness of our time series generation process through a time series forecasting application. Keywords: Time series features; Time series generation; Mixture autoregressive models; Time series forecasting; Simulation. 1 Introduction With the widespread collection of time series data via scanners, monitors and other automated data collection devices, there has been an explosion of time series analysis methods developed in the past decade or two. Paradoxically, the large datasets are often also relatively homogeneous in the industry domain, which limits their use for evaluation of general time series analysis methods (Keogh and Kasetty, 2003; Mu˜nozet al., 2018; Kang et al., 2017).