Inference for Nonlinear Dynamical Systems

2022-45 Workshop on Theoretical Ecology and Global Change 2 - 18 March 2009 Inference for nonlinear dynamical systems PASCUAL SPADA Maria Mercedes University of Michigan Department of Ecology and Evolutionary Biology 2019 Natural Science Bldg., 830 North University Ave. Ann Arbor MI 48109-1048 U.S.A. Inference for nonlinear dynamical systems E. L. Ionides†‡, C. Breto´ †, and A. A. King§ †Department of Statistics, University of Michigan, 1085 South University Avenue, Ann Arbor, MI 48109-1107; and §Department of Ecology and Evolutionary Biology, University of Michigan, 830 North University Avenue, Ann Arbor, MI 48109-1048 Edited by Lawrence D. Brown, University of Pennsylvania, Philadelphia, PA, and approved September 21, 2006 (received for review April 19, 2006) Nonlinear stochastic dynamical systems are widely used to model (16, 17). Important climatic fluctuations, such as the El Ninõ systems across the sciences and engineering. Such models are natural Southern Oscillation (ENSO), affect temperature and salinity, to formulate and can be analyzed mathematically and numerically. and operate on a time scale comparable to that associated with However, difficulties associated with inference from time-series data loss of immunity (18, 19). Therefore, it is critical to disentangle about unknown parameters in these models have been a constraint the intrinsic dynamics associated with cholera transmission on their application. We present a new method that makes maximum through the two main pathways and with loss of immunity, from likelihood estimation feasible for partially-observed nonlinear sto- the extrinsic forcing associated with climatic fluctuations (20). chastic dynamical systems (also known as state-space models) where We consider a model for cholera dynamics that is a continuous- this was not previously the case. The method is based on a sequence time version of a discrete-time model considered by Koelle and of filtering operations which are shown to converge to a maximum Pascual (20), who in turn followed a discrete-time model for likelihood parameter estimate. We make use of recent advances in measles (21). Discrete-time models have some features that are nonlinear filtering in the implementation of the algorithm. We apply accidents of the discretization; working in continuous time avoids the method to the study of cholera in Bangladesh. We construct this, and also allows inclusion of covariates measured at disparate confidence intervals, perform residual analysis, and apply other time intervals. Maximum likelihood inference has various conve- diagnostics. Our analysis, based upon a model capturing the intrinsic nient asymptotic properties: it is efficient, standard errors are nonlinear dynamics of the system, reveals some effects overlooked by available based on the Hessian matrix, and likelihood can be previous studies. compared between different models. The transformation- maximum likelihood ͉ cholera ͉ time series invariance of maximum likelihood estimates allows modeling at a natural scale. Non-likelihood approaches typically require a variance-stabilizing transformation of the data, which may confuse tate space models have applications in many areas, including scientific interpretation of results. Some previous likelihood-based Ssignal processing (1), economics (2), cell biology (3), mete- methods have been proposed (22–25). However, the fact that orology (4), ecology (5), neuroscience (6), and various others non-likelihood-based statistical criteria such as least square predic- (7–9). Formally, a state space model is a partially observed tion error (26) or gradient matching (27) are commonly applied to Markov process. Real-world phenomena are often well modeled ecological models of the sort considered here is evidence that as Markov processes, constructed according to physical, chem- ical, or economic principles, about which one can make only likelihood-based methods continue to be difficult to apply. Recent noisy or incomplete observations. advances in nonlinear analysis have brought to the fore the need for It has been noted repeatedly (1, 10) that estimating parameters improved statistical methods for dealing with continuous-time for state space models is simplest if the parameters are time-varying models with measurement error and covariates (28). random variables that can be included in the state space. Estimation Maximum Likelihood via Iterated Filtering of parameters then becomes a matter of reconstructing unobserved random variables, and inference may proceed by using standard A state space model consists of an unobserved Markov process, xt, techniques for filtering and smoothing. This approach is of limited called the state process and an observation process yt. Here, xt takes ޒdx ޒdy value if the true parameters are thought not to vary with time, or values in the state space , and yt in the observation space . The ␪ to vary as a function of measured covariates rather than as random processes depend on an (unknown) vector of parameters, ␪,inޒd . variables. A major motivation for this work has been the observa- Observations take place at discrete times, t ϭ 1,...,T;wewritethe tion that the particle filter (9–13) is a conceptually simple, flexible, vector of concatenated observations as y1:T ϭ (y1, ..., yT); y1:0 is and effective filtering technique for which the only major drawback defined to be the empty vector. A model is completely specified by was the lack of a readily applicable technique for likelihood the conditional transition density f(xt͉xtϪ1, ␪), the conditional dis- maximization in the case of time-constant parameters. The contri- tribution of the observation process f(yt͉y1:tϪ1, x1:t, ␪) ϭ f(yt͉xt, ␪), and bution of this work is to show how time-varying parameter algo- the initial density f(x0͉␪). Throughout, we adopt the convention that rithms may be harnessed for use in inference in the fixed-parameter f(⅐͉⅐) is a generic density specified by its arguments, and we assume case. The key result, Theorem 1, shows that an appropriate limit of that all densities exist. The likelihood is given by the identity ͉␪ ϭ͟T ͉ ␪ time-varying parameter models can be used to locate a maximum f(y1:T ) tϭ1 f(yt y1:tϪ1, ). The state process, xt, may be defined of the fixed-parameter likelihood. This result is then used as the in continuous or discrete time, but only its distribution at the basis for a procedure for finding maximum likelihood estimates for discrete times t ϭ 1,..., T directly affects the likelihood. The previously intractable models. challenge is to find the maximum of the likelihood as a function We use the method to further our understanding of the of ␪. mechanisms of cholera transmission. Cholera is a disease en- demic to India and Bangladesh that has recently become rees- tablished in Africa, south Asia, and South America (14). It is Author contributions: E.L.I., C.B., and A.A.K. performed research, analyzed data, and wrote highly contagious, and the direct fecal–oral route of transmission the paper. is clearly important during epidemics. A slower transmission The authors declare no conflict of interest. pathway, via an environmental reservoir of the pathogen, Vibrio This article is a PNAS direct submission. cholerae, is also believed to be important, particularly in the Abbreviations: ENSO, El Nin˜ o Southern Oscillation; MIF, maximum likelihood via iterated initial phases of epidemics (15). The growth rate of V. cholerae filtering; MLE, maximum likelihood estimate; EM, expectation—maximization. depends strongly on water temperature and salinity, which can ‡To whom correspondence should be addressed. E-mail: [email protected]. fluctuate markedly on both seasonal and interannual timescales © 2006 by The National Academy of Sciences of the USA 18438–18443 ͉ PNAS ͉ December 5, 2006 ͉ vol. 103 ͉ no. 49 www.pnas.org͞cgi͞doi͞10.1073͞pnas.0603181103 The basic idea of our method is to replace the original model with a closely related model, in which the time-constant parameter ␪ is replaced by a time-varying process ␪t. The densities f(xt͉xtϪ1, ␪), f( yt͉xt, ␪), and f(x0͉␪) of the time-constant model are replaced by f(xt͉xtϪ1, ␪tϪ1), f( yt͉xt, ␪t), and f(x0͉␪0). The process d␪ ␪t is taken to be a random walk in ޒ . Our main algorithm (Procedure 1 below) and its justification (Theorem 1 below) depend only on the mean and variance of the random walk, which Fig. 1. Diagrammatic representation of a model for cholera population dynamics. Each individual is in S (susceptible), I (infected), or one of the classes are defined to be Rj (recovered). Compartments B, C, and D allow for birth, cholera mortality, ͓␪ ͉␪ ͔ ϭ ␪ ͑␪ ͉␪ ͒ ϭ ␴2͚ and death from other causes, respectively. The arrows show rates, interpreted E t tϪ1 tϪ1 Var t tϪ1 as described in the text. ͓␪ ͔ ϭ ␪ ͑␪ ͒ ϭ ␴2 2͚ [1] E 0 Var 0 c . In practice, we use the normal distributions specified by Eq. 1. Here, natural selection without mutation, is rather ineffective. This ex- ␴ and c are scalar quantities, and the new model in Eq. 1 is identical plains heuristically why Procedure 1 is necessary to permit inference ␴ ϭ to the fixed-parameter model when 0. The objective is to obtain for fixed parameters via particle filtering. However, Procedure 1 ␪ ␴ 3 ͚ an estimate of by taking the limit as 0. is typically a diagonal and the theory given below apply more generally, and could be ␪ matrix giving the respective scales of each component of ; more implemented using any suitable filter. generally, it can be taken to be an arbitrary positive–definite symmetric matrix. Procedure 1 below is standard to implement, as Example: A Compartment Model for Cholera the computationally challenging step 2(ii) requires using only well In a standard epidemiological approach (29, 30), the population is studied filtering techniques (1, 13) to calculate divided into disease status classes. Here, we consider classes labeled 1 ␪ ϭ ␪ ͑␪ ␴͒ ϭ ͓␪ ͉ ͔ susceptible (S), infected and infectious (I), and recovered (R ,..., ˆt ˆt , E t y1:t k [2] R ).

Inference for Nonlinear Dynamical Systems

Comparison of Filtering Methods for the Modeling and Retrospective Forecasting of Influenza Epidemics

Application of Iterated Filtering to Stochastic Volatility Models Based on Non-Gaussian Ornstein-Uhlenbeck Process

Iterated Filtering 3

Comparison of Filtering Methods for the Modeling and Retrospective Forecasting of Influenza Epidemics

Iterated Filtering Methods for Markov Process Epidemic Models Arxiv

Application of Iterated Filtering to Stochastic Volatility Models Based on Non-Gaussian Ornstein-Uhlenbeck Process

Statistical Inference for Partially Observed Markov Processes Via the R Package Pomp

Iterated Filtering Algorithms

Statistical Inference for Partially Observed Markov Processes Via the R Package Pomp

Iterated Filtering and Smoothing with Application to Infectious Disease Models

Inference for Dynamic and Latent Variable Models Via Iterated, Perturbed Bayes Maps

Pomp’ July 31, 2016 Type Package Title Statistical Inference for Partially Observed Markov Processes Version 1.7 Date 2016-07-26