Comput. Methods Appl. Mech. Engrg. 196 (2007) 5063–5083 www.elsevier.com/locate/cma
New forms of extended Kalman filter via transversal linearization and applications to structural system identification
Shuva J. Ghosh, D. Roy *, C.S. Manohar
Structures Lab, Department of Civil Engineering, Indian Institute of Science, Bangalore 560 012, India
Received 28 September 2006; received in revised form 2 July 2007; accepted 5 July 2007 Available online 25 July 2007
Abstract
Two novel forms of the extended Kalman filter (EKF) are proposed for parameter estimation in the context of problems of interest in structural mechanics. These filters are based on variants of the derivative-free locally transversal linearization (LTL) and multi-step transversal linearization (MTrL) schemes. Thus, unlike the conventional EKF, the proposed filters do not need computing Jacobian matrices at any stage. While the LTL-based filter provides a single-step procedure, the MTrL-based filter works over multiple time-steps and finds the system transition matrix of the conditionally linearized vector field through Magnus’ expansion. Other major advantages of the new filters over the conventional EKF are in their superior numerical accuracy and considerably less sensitivity to the choice of time- step sizes. A host of numerical illustrations, covering single- as well as multi-degree-of-freedom oscillators with time-invariant parameters and those with discontinuous temporal variations, are presented to confirm the numerical advantages of the novel forms of EKF over the conventional one. 2007 Elsevier B.V. All rights reserved.
Keywords: Extended Kalman filter; Transversal linearization; Magnus’ expansion; State estimation; Structural system identification
1. Introduction filter. More recent developments have encompassed studies on evaluation of posterior pdf using Monte-Carlo simula- Dynamic state estimation techniques offer an effective tion procedures. These methods, styled as particle filters, framework for time-domain identification of structural sys- are computationally intensive but have wide-ranging capa- tems. These methods have their origin in the area of control bilities in terms treating nonlinearity and non-Gaussian of dynamical systems with the Kalman filter [19] being one features (see, for example, [9,34]). of the well-known tools of this genre. The filter provides In the area of structural engineering problems, dynamic the exact posterior probability density function (pdf) of state estimation methods have been developed and applied the states of linear dynamical systems with linear measure- in the context of structural system identification [49,15,10], ment model, with additive Gaussian noise processes. input identification [18,26], and active structural control Recent research in this area is focused on extending the (see, for example, [40]). In these applications, the process state estimation capabilities to cover nonlinear dynamical equations are typically derived based on the finite element systems and non-Gaussian additive/multiplicative noises. method (FEM) of discretizing structural systems. These Early efforts in dealing with nonlinear dynamical systems equations represent the condition of dynamic equilibrium were focused on application of linearization techniques and often constitute a set of ordinary differential equations which have led to the development of the extended Kalman (ODEs). The measurements are typically made on displace- ments, velocities, accelerations, strains and/or support reactions. Furthermore, the process and measurement * Corresponding author. Tel.: +91 8022933129; fax: +91 8023600404. equations are often taken to be contaminated by additive E-mail address: [email protected] (D. Roy). Gaussian noise processes. These noises themselves are
0045-7825/$ - see front matter 2007 Elsevier B.V. All rights reserved. doi:10.1016/j.cma.2007.07.004 5064 S.J. Ghosh et al. / Comput. Methods Appl. Mech. Engrg. 196 (2007) 5063–5083 modeled as a vector of white-noise processes or, more gen- stage. In addition to the laborious and possibly inaccurate erally, as a vector of filtered white-noise processes. In this nature of such an exercise in the context of finite element context the classical Kalman filter itself has limited applica- applications, this assumes importance in systems having tions since interest is often focused on systems with nonlin- vector fields with C0 continuity. Also, it has been validated ear mechanical properties. Consequently, studies dealing by the authors [12] that, due to less propagation of local with application of EKF methods to structural system errors, these methods work for considerably higher step identification have been undertaken [49,15,10]. Though in sizes. Moreover, a certain class of these methods (MTrL- most of the studies, the parameters to be estimated are I, [12,13]) qualifies as one of geometric integrators and essentially time-invariant, time varying parameter identifi- hence may be elegantly applied to preserve the invariants cation problems have also been studied [43,24,48,7]. One of motion. Thus, they offer an effective solution for a more of the important issues in the identification of structural general class of problems than methods based on tangential parameters is the interplay of the various uncertainties in (path-wise) linearization. Consequently, it is of consider- modeling and measurements and their effect on the vari- able interest to explore the application of these integration ability of the identified parameters. The variance of the sys- schemes in the context of deriving discrete equations for tem parameters, conditioned on measurements, essentially process equations in dynamic state estimation problems captures this feature. If this variance itself is computed in arising in structural system identification. an approximate manner, as is the case, for instance, when The present work accordingly proposes two alternative EKF or any suboptimal filtering strategy are used, satisfac- forms of the extended Kalman filter on the basis of the the- tory interpretation of the parameter variability becomes ories of local and multi-step transversal linearizations. The difficult. The works of Koh and See [22], Kurita and Mat- LTL method, originally proposed as a numeric-analytic sui [23] and Sanayei et al. [39] contain discussions on principle for solutions of nonlinear ODEs, can be consid- related issues. Recently applications of particle filter tech- ered as a bridge between the classical integration tools niques to structural system identification have been and the geometric methods. The essence of the LTL reported [29,7]) and control [38]. method is to exactly reproduce the vector fields of the gov- One of the main ingredients of filtering techniques erning nonlinear ODEs at a chosen set of grid points and applied to structural dynamic system is the discretization then to determine the discretized solutions at the grid of the governing equations of motion into a set of discrete points through a transversality argument. The condition- maps. In this context, it is of interest to note that the solu- ally linearized vector fields may be non-uniquely chosen tion of equilibrium equations in the finite element frame- in such a way that their form precisely matches the given work has been widely researched (see, for example, [3]). nonlinear vector field and are transversal to this field at Various methods such as central difference, Wilson-theta, least at the grid points. The recent multi-step extension of Newmark-beta, etc., have been developed and are routinely the LTL, referred to as the MTrL method, uses the solu- available in commercial finite element packages. Questions tions of transversally linearized ODEs in closed-form and on spectral stability and accuracy of these schemes have hence distinguishable from other multi-step, purely numer- thus been discussed. Most studies that combine finite ele- ical, implicit methods. Owing to the condition of transver- ment structural modeling with dynamic state estimation sality, the need for any differentiation of the original vector methods employ either forward or central difference-based field is eminently avoided in this class of methods. schemes for discretizing the governing equations (see, for It is theoretically impossible to derive a time-invariant example [6]). Alternatively, in the context of nonlinear linear dynamical system whose evolution will, in general, structural mechanics problems, a closed-form expression precisely match that of a given nonlinear system over any of the state transition matrix (STM) is employed on the lin- finite time interval. Over a sufficiently small time interval, earized equations of motion to obtain a discrete map for fil- however, the nonlinear equations may be conditionally lin- tering applications (see, for example [49]). Thus it appears earized which is presently accomplished using the still- that the relationship between the methods of integrating unknown, instantaneous estimates to be determined at governing equation of motion and the formulation of dis- the grid points over the interval. Once these conditionally crete maps for dynamic state estimation techniques has linearized equations are derived, a discrete Kalman filtering not been adequately explored in structural engineering algorithm may be applied to get estimates of states of the applications. The present work is in the broad domain of system. It should be understood that, the LTL-based Kal- addressing this issue. man filter, dealing with conditionally linearized equations, In the recent years, the present authors have been results in implicit and nonlinear algebraic equations involved in research related to the development of thereby necessitating the use of either a Newton–Raphson numeric-analytic solution techniques for nonlinear ODEs or a fixed-point iteration scheme to solve for the instanta- and stochastic differential equations (SDE) based on the neous estimates. The basic idea is that the constructed con- concept of transversality, namely, locally transversal linear- ditionally linearized estimate manifold is made to ization (LTL) [37,35] and multi-step transversal lineariza- transversally intersect the nonlinear (true) estimate mani- tion (MTrL) [36,12,13] methods. These methods fold at time instants where the estimate vectors need to completely avoid gradient (Jacobian) calculations at any be determined. While choices for conditionally linearized S.J. Ghosh et al. / Comput. Methods Appl. Mech. Engrg. 196 (2007) 5063–5083 5065
LTL vector fields are non-unique (i.e., infinitely many), the external (non-parametric and deterministic) force vec- solutions are, more or less, insensitive to the specific chosen tor. The components of the process noise vector m forms. The condition of intersection of manifolds at the nsðtÞ2R are taken to be independently evolving Gaussian grid points provides a set of constraint equations whose white noise processes. Gb is an n · m noise distribution ma- zeros are the instantaneous estimates. For the MTrL-based trix. The noise term in Eq. (1) represents the effects of mod- Kalman Filter, the linearization is implicitly achieved using eling errors and errors in applying the test signals. an interpolating expansion of part(s) of the nonlinear vec- Presently, we assume that elements of QðX ; X_ ; tÞ and FbðtÞ tor functions in terms of the unknown estimates at p (for an are Ck functions, k 2 Zþ,inX and t with k P 1. Define order p MTrL) grid points. We emphasize that this scheme the new state vector as yields the estimates at p points simultaneously (via a single T T T linearization step) and, to the authors’ knowledge, it is not ZðtÞ¼fZsðtÞ ZpðtÞ g ; ð2Þ an aspect that has been explored earlier. However, the pro- where ZðtÞ2R2nþl is the extended state vector at time t, posed filtering algorithms still use a sequential linear esti- Z (t) is the vector of states of the original system, i.e., mation theory in the form of the Kalman filter and thus s Z ðtÞ¼fX TX_ TgT, Z 2 Rl is the vector of model parame- complete information about the posterior probability den- s p ters (e.g., stiffness, damping or any nonlinear stiffness or sity function of the states is not obtainable. We have used a damping parameters to be identified). Furthermore, n is weighted global iteration procedure (WGI) [41] with the the number of degrees of freedom (DOF) and l is the total proposed filters replacing the conventional EKF. Several number of model parameters that need to be identified. For numerical examples, covering both linear and nonlinear a general time-invariant parameter identification problem, structural behavior, are considered to verify the potential the state space equations for the dynamical system in Eq. and relative merits of the method. Along with a linear (1) has the following representation: multi-degree-of-freedom (MDOF) shear building, a hard- ening Duffing oscillator and a hysteric oscillator (Bouc– _ e e ZsðtÞ¼g~ðZsðtÞ; ZpðtÞ; tÞþF ðtÞþGðtÞnsðtÞ; ð3Þ Wen hysteresis model) have been used for numerical study. _ In each case, the forward system model is simulated on a ZpðtÞ¼0 þ npðtÞ; ð4Þ computer using the stochastic Heun’s method to generate where g~ : R2n Rl R ! R2n, Fe : R ! R2n and Ge 2 R2n m synthetic noisy measurements, which are fed to the filtering are respectively the augmented state-dependent drift vec- algorithm to arrive at estimates of the model parameters. tor, forcing vector and process noise distribution matrix with the last two being appropriately padded by zeros 2. Process and measurement equations wherever needed. Note that the model parameters are ex- pressed in terms of elements of Zp(t), which are declared l Identification of any dynamical system requires, in gen- as the additional states of the system. npðtÞ2R is a vector eral, two models. Firstly, a mathematical model describing of independent white noise processes that are small random the evolution of state with time, i.e., the process equations, disturbances to states and may be interpreted as an artifi- to idealize the dynamics of the physical system, and, sec- cial evolution of the model parameters in time [8,25].It ondly, measurement (or observation) equations, relating should be noted that once the system identification prob- the measured quantities to the state variables, should be lem is formulated by declaring the unknown model param- available. Both these models include noise terms that serve eters as additional states of the system, even a problem to account for unmodeled dynamics in the process equa- without any mechanical nonlinearity is transferred to the tions and measurement errors in the observation equations. domain of nonlinear filtering. It is common practice to model the noise terms as Gaussian The two Eqs. (3) and (4) may be combined to obtain white noise processes. In time-domain system identification _ problems, the parameters of the mathematical model are ZðtÞ¼gðZðtÞ; tÞþF ðtÞþGðtÞnðtÞ; ð5Þ declared as additional states so that any acceptable algo- 2nþl ð2nþlÞ ð2nþlÞ where g : R R ! R ; F : ½t0; T R ! R ; rithm for state estimation becomes applicable for these 2n l G(t)isa(2n + l) · (2n + l) matrix and nðtÞ2R þ . The problems as well. In the general context of engineering observation equation may be written as dynamical systems, one generally starts with the following set of second order, nonlinear ODEs, derivable via a spatial Y ðtÞ¼h~ðZðtÞ; tÞþgðtÞ: ð6Þ projection technique (such as the Petrov–Galerkin) applied Here Y ðtÞ2Rr; h~ : Rð2nþlÞ R ! Rr; gðtÞ2Rr is a vector to the governing PDEs: of independent white noise processes and r 6 2n. The noise € _ _ b b term g(t) in Eq. (6) is used to model measurement errors MX þ CX þ KX þ QðX ; X ; tÞ¼F ðtÞþGðtÞnsðtÞð1Þ and possible errors in relating the observed quantities with n with X 2 R , t 2½t0; T R, M being the mass matrix, C the state variables. The above representation of the obser- the viscous damping matrix, K the stiffness matrix, vation equation is formal and observations, in practice, QðX ; X_ ; tÞ : Rn Rn R ! Rn the nonlinear and/or para- would be made at a set of discrete time instants so the mea- b n metric part of the vector field, and F : ½t0; T R ! R surement equation may be realistically represented as 5066 S.J. Ghosh et al. / Comput. Methods Appl. Mech. Engrg. 196 (2007) 5063–5083
~ Y k ¼ Y ðtkÞ¼hðZðtkÞ; tkÞþgðtkÞ; ð7Þ where Rk+1 is the covariance matrix of the measurement noise where tk = kDt, Dt being the sampling interval and g(tk) "# represents a sequence of r independent Gaussian random oh~ðZ; tÞ variables with covariance matrix R . As has been men- H kþ1 ¼ : ð15Þ k oZ b tioned in Section 1, the objective of the present study is ZðtÞ¼Z kþ1jk ;tkþ1 to develop two new forms of EKF based on transversal 4. Obtain the new estimate Zb of state and the associ- intersection concepts, namely the LTL-based EKF and kþ1jkþ1 ated error covariance matrix P as the MTrL-based EKF. As a prelude to that we briefly out- k+1jk+1 b b ~ b line the steps involved in the estimation of unknown struc- Z kþ1jkþ1 ¼ Z kþ1jk þ Kkþ1½Y kþ1 hðZ kþ1jk; tkÞ ; ð16Þ tural parameters by the conventional EKF. T P kþ1jkþ1 ¼½I Kkþ1H kþ1 P kþ1jk½I Kkþ1H kþ1 T 3. Structural system identification by EKF þ Kkþ1Rkþ1Kkþ1: ð17Þ Estimation of model parameters by EKF has been widely We begin by considering the process Eq. (1) and the studied. Asymptotic convergence results are also available measurement Eq. (7) and summarize the recursive algo- [27]. Since EKF is based on a linearization of the state rithm involved in implementing the EKF [49]. Given an ini- b and observation equations over each time-step (only up tial estimate Z 0j0 of the state and the error covariance P0j0, to first-order terms in the Taylor expansion are retained) start the loop. and a subsequent usage of the linear estimation theory in For k ¼ 0; 1; 2; 3 ...; m and ðt 2ð0; tm Þ, perform the fol- the form of Kalman filter, the deviation of the estimated lowing steps: trajectory form the true trajectory increases with time b and the significance of higher order terms in the Taylor 1. Begin with the filter estimate Z kjk and its error covari- expansion also increases. Due to the accumulation and ance matrix Pk k. j b propagation of local errors, there is a distinct possibility 2. Evaluate the predicted state Z kþ1jk and the predicted of divergence, especially if the EKF starts off with poor ini- error covariance matrix Pk+1jk by Z tial estimates (see, for instance, [44,14,4]). This also implies t no b b kþ1 b that an increased sensitivity of numerical solutions to step- Z kþ1jk ¼ Z kjk þ gðZ tjkðtÞÞ þ F ðtÞ dt; ð8Þ sizes can potentially lead to biased estimates [6]. tk T P kþ1jk ¼ Uðtkþ1; tkÞP kjkU ðtkþ1; tkÞþQkþ1; ð9Þ
where Uðtkþ1; tkÞ is the state transition matrix of the lin- 4. Formulations of LTL- and MTrl-based EKFs earized system and is given by We will presently employ the concepts of transversal Uðt ; t Þ¼exp½ðr Þðt t Þ : ð10Þ kþ1 k k kþ1 k intersection to develop two new forms of the EKF, which Here bear a resemblance to a collocation method in the sense ogðZ; tÞ that the filter estimates will be equal to the ‘optimal’ esti- r ¼ ð11Þ mates at the points of discretization and not necessarily k oZ b ZðtÞ¼Z kjk ;tk over the entire time interval. Nevertheless, we note that is the (2n + l) · (2n + l) system Jacobian matrix (that the estimate for a general nonlinear system via the pro- obtains the first-order Taylor increment of the nonlinear posed as well a conventional forms of the EKF is not function g) and the mean square estimate, as obtainable (in principle) Z Z through a strictly nonlinear filter (such as a Monte-Carlo tkþ1 tkþ1 T filter). Indeed the ‘optimal’ solution presently refers to a Qkþ1 ¼ Uðtkþ1; uÞGðuÞE½nðuÞn ðvÞ tk tk minimum mean square estimator (MMSE) based on the T T G ðvÞU ðtkþ1; vÞdudv; ð12Þ assumption of a Gaussian posterior probability density function. where Qk+1 is the process error covariance matrix (E(Æ) denotes the expectation operator). Note that, following 4.1. The LTL-based EKF linearization over the time interval ðtk; tkþ1 , the approx- imate solution to Eq. (5) is given by Z The general idea of the LTL-based EKF for system t b identification is to replace the nonlinear vector field of ZðtÞ¼Uðt; tkÞZ kjk þ Uðtk; uÞ½F ðuÞþGðuÞnðuÞ du: the process equation by a time-invariant conditionally lin- t k earized vector field over a time-step and an application of 13 ð Þ Kalman filter based on the estimates at the right end of 3. The Kalman gain is given by the time-step. Consider Eq. (5) corresponding to a nonlin- ear dynamical system. Let the time interval of interest T T 1 Kkþ1 ¼ P kþ1jkH kþ1 H kþ1P kþ1jkH kþ1 þ Rkþ1 ; ð14Þ ð0; T R be divided into m sub-intervals with S.J. Ghosh et al. / Comput. Methods Appl. Mech. Engrg. 196 (2007) 5063–5083 5067
0 ¼ t0 < t1 < t2 < ti < < tm ¼ T and hi = ti+1 ti, gðZ; tÞ¼BðZ; tÞZ; ð22Þ i Zþ. It is now required to replace the process equation 2 h~ðZ; tÞ¼HðZ; tÞZ ð23Þ by a suitably chosen set of m transversally linearized vector equations, wherein the solution to the i-th element of the with BðZ; tÞ and HðZ; tÞ being finite quantities. Eqs. (20) set should be, in a sense, a representative of the nonlinear and (21) may be viewed as being conditionally linear with flow over the i-th time-step. Note that when the measure- constant coefficients, given the value of the still-unknown m b b ment relationship given by Eq. (6) is nonlinear, addi- estimated quantities Z i ¼ ZðtiÞ. Now the proposed filter tional sets of conditionally linearized algebraic vector may be summarized as follows: equations have also to be chosen. We define T : (t ,t ]. b i ¼ i i+1 Given initial estimates of state Z 0j0 and the error covari- For a given set of measurements with the process equations ance matrix P0j0, loop over k ¼ 0; 1; 2; 3 ...; m for precisely known, the optimal estimate of the vector state ðt 2ð0; tm Þ using the following steps: t(Æ) (the sense of optimality has been enunciated earlier) may be thought of as a flow evolving on a compact mani- b 1. Start with filter estimates Z kjk and its error covariance fold M such that matrix Pk k. j b 2. Evaluate the predicted state Z kþ1jk as the LTL-based t : M R ! M ð18Þ solution of the following nonlinear differential equation k k b is C (k P 0) on M. Then, for any t, t(Z)isaC diffeomor- at t = tk+1 subject to the initial condition Z kjk at t = tk: phism M ! M. Let ðZi; tiÞ and ðZiþ1; tiþ1Þ2M denote a pair of points on the estimate manifold (with a local struc- b_ b Z tjtk ¼ gðZ tjtk ; tÞþF ðtÞ; ð24Þ 2nþlþ1 ture of R ). Now one canP constructP a couple of 2n + l- dimensional cross-sections i and iþ1 transverse to the which is the process Eq. (5) without noise. This is be- vectorP field at ðZi; tiÞPand ðZiþ1; tiþ1Þ, respectively, such that cause the prediction stage basically provides the ex- Zi 2 i and Ziþ1 2 iþ1. At this point we can view the pected values of the state at the forward time-points P P and hence it is reasonable to ignore the additive zero- Kalman filter as a map Ke : Rr R R ! such i iþ1 mean white noise at this stage. that 3. Write down the exact solution of the conditionally line- e Ziþ1 ¼ KðZi; Y iþ1; ti; hiÞ; ð19Þ arized equation as Z where Y 2 Rr is the observation vector at the (i + 1)th t iþ1 ZðtÞ¼Uðt; t ; Zb ÞZb þ UðZb ; t ; uÞ time instant. Ke may be thought of as a Poincare (strobo- k kþ1jkþ1 kjk kþ1jkþ1 k tk scopic) map as in the case of a nonlinear flow. The principle ½F ðuÞþGðuÞnðuÞ du; ð25Þ of an LTL-based EKF is thus to generate a set of condi- tionally linear equations which, when used with a Kalman where t 2ðt ; t and Uðt; t ; Zb Þ is the state transi- filter, gives rise to a map Kb that should be (ideally) identical k kþ1 k kþ1jkþ1 e tion matrix (or the fundamental solution matrix) of the to K for the same initial condition ðZi; tiÞ and the same linearized system. The latter is given by interval hi. It is conceivable that the estimate manifold of the proposed filter has to transversally intersect the optimal b b Uðt; tk; Z kþ1jkþ1Þ¼exp½ðBðZ kþ1jkþ1; tiÞðt tkÞ : ð26Þ estimate manifold at least at the grid points in 4. Estimate the predicted error covariance matrix P kþ1jk by S ¼ftiji ¼ 0; 1; 2; 3; ...g and not necessarily elsewhere. Remark 1: From a straightforward application of the P ðZb Þ¼Uðt ; t ; Zb ÞP UTðt ; t ; Zb Þ weak transversality theorem [2], it follows that mappings kþ1jk kþ1jkþ1 kþ1 k kþ1jkþ1 kjk kþ1 k kþ1jkþ1 b transversal to M at a given instant ti form an open and þ Qkþ1ðZ kþ1jkþ1Þ: everywhere dense set in the space of smooth maps ð27Þ f i : M ! M i, where M i denotes the set of all transversally intersecting smooth manifolds. Hence the derivation of a Here the process error covariance matrix Qkþ1 is given by conditionally linear system and therefore, the estimates of Z Z the proposed filter are non-unique and infinitely many. tkþ1 tkþ1 b T For instance, Eqs. (5) and (6) may give rise to the fol- Qkþ1 ¼ Uðtkþ1; u; Z kþ1jkþ1ÞGðuÞE½nðuÞn ðvÞ t t lowing LTL system (with conditionally constant coeffi- k k T T b G ðvÞU ðtkþ1; v; Z kþ1jkþ1Þdudv: ð28Þ cients) over the interval ðti; tiþ1 for purposes of state estimation using the Kalman filter: 5. The Kalman gain is given by _ b b b T b Z ¼ BðZ i; tiÞZ þ F ðtÞþGðtÞnðtÞ; ð20Þ Kkþ1ðZ kþ1jkþ1Þ¼P kþ1jkðZ kþ1jkþ1ÞH kþ1ðZ kþ1jkþ1; tkþ1Þ b b b Y ¼ HðZ i; tiÞZ þ gðtÞð21Þ ½P kþ1jkðZ kþ1jkþ1ÞH kþ1ðZ kþ1jkþ1; tkþ1ÞP kþ1jk ðZb ÞH T ðZb ; t ÞþR 1; provided that the vector functions g and h~ are decompos- kþ1jkþ1 kþ1 kþ1jkþ1 kþ1 kþ1 able as ð29Þ 5068 S.J. Ghosh et al. / Comput. Methods Appl. Mech. Engrg. 196 (2007) 5063–5083
where Now consider the sub-interval I1 and let it be ordered into p b smaller intervals as Y kþ1 ¼ H kþ1ðZ kþ1jkþ1; tkþ1ÞZ þ gðtÞ: ð30Þ b ð0 ¼ t0; t1 ; ðt1; t2 ; ...; ðtp 1; tp ¼ T 1 6. Obtain the new estimate of the state Z kþ1jkþ1 via with the time-step h = t t , where i 2 Zþ; i 6 p. Simi- b b ~ b i i+1 i Z kþ1jkþ1 ¼ Z kþ1jk þ Kkþ1½Y kþ1 hðZ kþ1jk; tkÞ : ð31Þ larly each succeeding interval is subdivided into p sub- Now the conditionally linearized estimate and the opti- intervals and so ordered that mal estimate will become instantaneously identical at ð0; T ¼fðt0; t1; t2; ...; tp ; ðtp; tpþ1; tpþ2; ...; t2p ;...; t if a root ðZb Þ of Eq. (31) can be found such k+1 kþ1jkþ1 ðt ; t ; t ; ...; t g; ð36Þ that ðM 1Þp ðM 1Þpþ1 ðM 1Þpþ2 Mp b b where tjp ¼ T j; j ¼ 1; 2; 3; ...; M. Z kþ1jkþ1 ¼ Z kþ1jkþ1: ð32Þ Let Si ¼fði 1Þp; ði 1Þp þ 1; ði 1Þp þ 2; ...; ipg b Zþ. The idea is to derive a linearized system for each In other words,T the constraint condition i.e. Z kþ1jkþ1; b sub-interval Ii,16 i 6 M. Moreover, following the concept Z kþ1jkþ1 2 M M i is automatically satisfied if a real and physically admissible root of the following transcen- of transversal linearization, it is intended that the state esti- dental equation can be found: mate computed through an application of the Kalman filter to the linearized equations and the ‘best’ estimates remain b b ~ b Z kþ1jkþ1 Z kþ1jk þ Kkþ1½Y kþ1 hðZ kþ1jk; tkÞ ¼ 0: ð33Þ identical at all points of discretization in that interval, viz. at t ; t ; t ; ...; t 2 I . This can be done through any standard root-finding ði 1Þp ði 1Þpþ1 ði 1Þpþ2 ip i For the problem of parameter estimation at hand based algorithm as the fixed-point iteration, Newton–Raphson on Eqs. (5) and (7), it is naturally sought to derive the lin- method, etc. earized system such that it is also (2n + l)-dimensional and 7. Once the true estimate is obtained as the solution of Eq. is obtainable from the given nonlinear system with simple (33), the error covariance matrix P associated k+1jk+1 and least alterations. Firstly, the part of time-axis, over with the new state may be estimated as which the estimates are sought, needs to be discretized T P kþ1jkþ1 ¼½I Kkþ1H kþ1 P kþ1jk½I Kkþ1H kþ1 and ordered as mentioned above. The integer M denotes the number of times the MTrL-based linearization proce- þ K R KT ; ð34Þ kþ1 kþ1 kþ1 dure has to be applied to get the estimates over the entire where the quantities (without bar) are computed by time interval of interest (0,T]. Each application of the substituting in the corresponding quantities (with bar) MTrL procedure produces the estimates of (2n + l)p dis- the true value of the estimates obtained from Eq. (33). cretized (scalar) state variables at p grid points over each sub-interval Ii. Thus it is a multi-step procedure for the esti- 4.2. The MTrL-based EKF mation problem at p points over each linearization step. The estimate at the right end of the interval Ii will provide The second class of the EKF, referred to as the MTrL- an initial estimate for the linearized system over Ii+1. based EKF, may be viewed as an extension and further For further elaboration of the methodology, Eqs. (5) generalization of the LTL-based EKF. The proposed filter and (6) are considered again and linearized over the inter- is based on the new and more flexible MTrL method [12] val Ii using the unknown estimates at p + 1 points as instead of the earlier version of the MTL method [36] follows: wherein the only way to treat the nonlinear terms was to Z_ ¼ BðZ ; tÞZ þ F ðtÞþGðtÞnðtÞ; ð37Þ convert them into conditional forcing functions. With the latest version of the MTrL applied to nonlinear structural Y ¼ HðZ ; tÞZ þ gðtÞð38Þ dynamics, the nonlinear damping and stiffness terms in provided that the vector functions g and h~ are decompos- the original system appear respectively as conditionally able as determinable damping and stiffness coefficients in the line- arized system as well. The fundamental solution matrix gðZ; tÞ¼BðZ; tÞZ; ð39Þ (FSM) or the state transition matrix of the linearized sys- h~ðZ; tÞ¼HðZ; tÞZ ð40Þ tem may be obtained through Magnus’ expansion [28], which is derivable based on the assumption that the linear- with BðZ; tÞ and HðZ; tÞ bounded over Ii. In the present ized system matrix is a sufficiently smooth Lie element. work Z is obtained through an interpolation over the b For the purpose of implementation, let the subset of the known initial vector estimate fZ i 1 p i 1 pg and unknown b ð Þ jð Þ time-axis over ð0; T be divided into M closed–open sub- vectors fZ kjkjk 2 Si and k 6¼ði 1Þpg at tk : k 2 Si. This intervals as expansion leads to an approximation of BðZ; tÞ and HðZ; tÞ over I and hence a conditionally linearized set of ð0; T ¼fI ; I ; I ; ...; I g i 1 2 3 1 equations over the i-th time interval. For instance, if one ¼fð0 ¼ T 0; T 1 ; ðT 1; T 2 ; ...; ðT N 1; T N ¼ T g: ð35Þ chooses these interpolating functions to be Lagrangian S.J. Ghosh et al. / Comput. Methods Appl. Mech. Engrg. 196 (2007) 5063–5083 5069 polynomials of degree p, Zb is approximated in a vector Recast, for notational convenience, Eqs. (37) and (38) as space Vp of polynomials spanned by the Lagrange polyno- mials such that the dimension of this approximating space Z_ ¼ WðtÞZ þ F ðtÞþGðtÞnðtÞ; ð43Þ is p. The set of interpolating Lagrange polynomials Y ¼ WH Z þ gðtÞ; ð44Þ fP kjk ¼ 0; 1; 2; ...; pg over Ii are given by Yip where WðtÞ¼BðZ; tÞ and WH ¼ HðZ; tÞ. Also let t tj P kðtÞ¼ ð41Þ tk tj b b b b j¼ði 1Þp Z I ¼fZ ði 1Þpþ1jði 1Þpþ1; Z ði 1Þpþ2jði 1Þpþ2; ...; Z ipjipgð45Þ and one may write denote the set of unknown estimates over Ii. Given initial b estimates of state Z 0j0 and the error covariance matrix Xip P , use a loop over i ¼ 1; 2; 3 ...; M,withðt 2ð0; T Þ,to b 0j0 Z ¼ P kðtÞZ kjk: ð42Þ implement the following steps: k¼ði 1Þp b 1. Start with filter estimates Z ði 1Þpjði 1Þp and its error One is not restricted in choosing only Lagrangian polyno- covariance matrix P . mials to arrive at the vector function Z . For instance, one (i 1)pj(i 1)p 2. Evaluate the predicted states fZb jj 2 S ; j 6¼ði 1Þ may use distributed approximating functionals [47] or jjði 1Þp i pg as solution of the following nonlinear differential wavelet-based [42] interpolating functions to obtain the equation at {tjjj 2 Si; k 5 (i 1)p} subject to the initial function. However, irrespective of the specific form of condition Zb at t = t interpolation, the MTrL-based linearized system corre- ði 1Þpjði 1Þp (i 1)p sponding to the nonlinear system (5) and (6) takes the form b_ b Z tjt i 1 p ¼ gðZ tjt i 1 p ; tÞþF ðtÞ; ð46Þ given by Eqs. (37) and (38). It must however be recorded ð Þ ð Þ that the use of such expansions is not quite consistent with which is the process Eq. (5) without noise. This is feasi- the nature of strong solutions of a stochastic differential ble as the prediction stage basically gives the expected equation under white noise. It is well-known that the solu- values of the estimate at the forward time-points and tion of an SDE is a semi-martingale and admits an expan- hence the additive zero-mean white noise (vector) term sion through a stochastic Taylor expansion (STE) (see, for is left out. The simultaneous prediction at p points instance, [21,30]). For instance, an STE over a time-step Dt may be achieved using the class II MTrL [12]. The class has terms with fractional powers of Dt and such an expan- II MTrL avoids the computationally cumbersome Mag- sion is therefore not spanned by the monomials 1, t, t2 and nus’ expansion by treating the nonlinear functions in the so on. However, we are presently interested in obtaining a process equation as a conditional forcing function and reasonably accurate and numerically stable state estimate, thereafter solves for the discretized state variables ensur- which is itself non-stochastic. Indeed, as substantiated with ing a transversal intersection of the original and the con- extensive numerical simulations, the errors in the computed ditionally linearized flows at the grid points through estimates owing to the relaxation of the mathematical rigor appropriate constraints. are not substantial as long as the additive noise intensity is 3. Discretize the conditionally linearized equation as in Eq. not too high. The rest of the procedure is quite similar to (25) and derive p sets of STMs and process noise covari- the LTL-based EKF with the exception that the state tran- ance matrices. The STMs are given by sition matrix for the linearized system with time dependent U :¼ Uðt ; t ; Zb ; Zb Þ¼expðXðtÞÞ; ð47Þ coefficients has to computed in a different manner (see j ði 1Þp j ði 1Þpjði 1Þp I
Appendix A) in the form of a series expansion following where {j 2 Si; j 5 (i 1)p} and are obtainable by substi- Magnus [28]. It should be noted that the STM U(t) for tuting W(t) for A(t) in Eq. (A.4), the limits of integration the linearized system over the interval Ii is obtained from being t(i 1)p to tj in the jth case. Denoting the error b b the originally nonlinear system of equations through condi- covariance matrix as Qj :¼ QjðZ ði 1Þpjði 1Þp; Z I Þ may be tional linearization using the state estimates at discrete obtained in an identical manner like Eq. (28) by using b b time-points i.e. Z ði 1Þp (known) and fZ kjk 2 Si; k 6¼ the STMs as obtained above.
ði 1Þpg (unknown) through Eq. (A.5) and thus the 4. Estimate the predicted error covariance matrices P jjði 1Þp, STM is a function of these unknown estimates at the grid where {j 2 Si; j 5 (i 1)p}, by points. Following a similar line of transversality argument b b T as in the LTL-based EKF, it is evident that for a given set P j :¼ P jjði 1ÞpðZ ði 1Þpjði 1Þp; Z I Þ¼UjP ði 1Þpjði 1ÞpUj þ Qj: of observations, the ‘best’ estimate manifold and the esti- ð48Þ mate manifold through the application of the Kalman filter to the linearized system may be made instantaneously iden- 5. The Kalman gains at {tjjj 2 Si; j 5 (i 1)p} are given by tical at the p grid points over I through enforcement of 1 i K :¼ K ðZb ; Zb Þ¼P WT P W P WT þ R : appropriate constraint equations (algebraic). The whole j j ði 1Þpjði 1Þp I j H j H j H j procedure can be summarized as follows. ð49Þ 5070 S.J. Ghosh et al. / Comput. Methods Appl. Mech. Engrg. 196 (2007) 5063–5083 b 6. Obtain new estimates of states fZ jjjjj 2 Si; j 6¼ði 1Þpg 5.1. Example-1 – a hardening, single-well Duffing oscillator via: Consider a hardening, single-well Duffing oscillator sub- b b b b b ~ b Z jjj :¼ Z jjjðZ ði 1Þpjði 1Þp; Z I Þ¼Z jjði 1Þp þ Kj½Y j hðZ jjði 1Þp;tjÞ : jected to sinusoidal driving force corrupted by a white- ð50Þ Gaussian noise: 3 Now the conditionally linearized estimates and the opti- m€x þ cx_ þ kx þ ax ¼ F cosðktÞþrpnðtÞ: ð54Þ mal estimates would become instantaneously identical at The problem at hand consists of estimating the parameters {t jj 2 S ; j 5 (i 1) p} if and only if zeros of the Eq. j i fc; k; ag from a set of observations related directly to the (31) can be found such that states. Accordingly, these are declared as the additional b states corresponding to an augmented system. The mass Z ¼ Zb s:t: fj 2 S ; j 6¼ði 1Þpg: ð51Þ jjj jjj i m has been set to 1 kg and hence does not appear in the In other words, the above constraint condition is auto- expressions to follow. For the present problem, unless men- matically satisfied if p sets of (2n + p) dimensional real tioned otherwise, the measured component (Y) will be the and physically admissible vector root of the following displacement (x) of the oscillator. Writing Eq. (54) in a (2n + l)p dimensional transcendental equation can be state space form and including the measurement model, found: the mathematical model for the estimation problem is pro- vided by the equations: b b ~ b Z jjj Z jjði 1Þp þ Kj½Y j hðZ jjði 1Þp; tjÞ ¼ 0 z_1 ¼ z2; s:t: fj 2 Si; j 6¼ði 1Þpg: ð52Þ 3 z_2 ¼ z4x1 z3z2 z5z1 þ F cosðktÞþrpnpðtÞ;
In order to avoid taking derivatives of the vector func- z_3 ¼ 0 þ rcncðtÞ; ð55Þ tions at all stages, we have presently used fixed-point z_4 ¼ 0 þ rkn ðtÞ; iteration in preference to the Newton–Raphson method. k 7. Once the true estimate is obtained as the solution of Eq. z_5 ¼ 0 þ ranaðtÞ (52), the associated error covariance matrix Pipjip associ- and ated with the new state at the last point of discretization in Ii can be estimated via Y ¼ z1 þ rY nY ðtÞ: ð56Þ
T T Here z1 = x and z2 ¼ x_ are the displacement and the veloc- P ipjip ¼½I KipWH P ipjip½I KipWH þ KipRipKip; ð53Þ ity states, respectively, z3, z4, z5 are the states in the ex- where the quantities (without bar) are computed by tended state space corresponding to fc; k; ag. rp is the substituting in the corresponding quantities (with bar) enveloping factor for the original process noise np(t) the true value of the estimate obtained from Eq. (53). whereas rc, rk, ra are the enveloping factors for the artifi- cial evolution of the respective parameters. rY is the envel- oping factor for the measurement noise. As has been 5. Numerical examples mentioned earlier, Eq. (56) is just a representative form and in practice the measurement can be represented as in Numerical illustrations on single and multi-DOF sys- Eq. (7). The specific forms of the conditionally linearized tems are provided in this section. The synthetic measure- equations used for different cases will be indicated at ment data have been generated on a computer by the appropriate places. However, the solution procedure by integrating the process SDE using the stochastic Heun’s the conventional EKF is well known and will not be elab- method. With these measurements and the associated sys- orated presently. tem models, the problem of parameter identification has For the LTL-based EKF, over the time-interval ðti 1; ti , been taken up by the three kinds of filters, viz. the conven- the conditionally-linearized system has the following state tional EKF, the LTL-based EKF and the MTrL-based space representation: 8 9 2 38 9 EKF. While the first problem involving a hardening Duf- > _ > > > > z1 > 01 0 0 0 > z1 > fing oscillator is extensively studied for a number of sub- > > 6 3 7> > <