An Improved Portmanteau Test for Autocorrelated Errors in Interrupted Time-Series Regression Models

An Improved Portmanteau Test for Autocorrelated Errors in Interrupted Time-Series Regression Models

Behavior Research Methods 2007, 39 (3), 343-349 An improved portmanteau test for autocorrelated errors in interrupted time-series regression models BRADLEY E. HUITEMA AND JOSEPH W. M CKEAN Western Michigan University, Kalamazoo, Michigan A new portmanteau test for autocorrelation among the errors of interrupted time-series regression models is proposed. Simulation results demonstrate that the inferential properties of the proposed QH–M test statistic are considerably more satisfactory than those of the well known Ljung–Box test and moderately better than those of the Box–Pierce test. These conclusions generally hold for a wide variety of autoregressive (AR), moving averages (MA), and ARMA error processes that are associated with time-series regression models of the form described in Huitema and McKean (2000a, 2000b). Several approaches are available for the analysis of in- on intervention-effect coefficients (Huitema, McKean, & terrupted time-series (or time-series intervention) experi- McKnight, 1994). Interestingly, the problem of distorted mental designs. The simplest version of this design has Type I error rates is also the main reason that these alter- two phases. The first phase (sometimes called the base- native estimation methods are recommended in the first line or preintervention phase) has n1 observations, and place. Hence, there are two reasons why it is important to the second phase (postintervention) has n2 observations. know whether the errors are autocorrelated. First, if the The purpose of the statistical analysis is to quantitatively errors are not autocorrelated, OLS is the preferred method describe and evaluate possible intervention effects. of estimation, and corrective methods should be avoided. Autoregressive integrated moving average (ARIMA) Second, if the errors are autocorrelated, corrective meth- models and time-series regression models are frequently ods are called for in order to maintain error rates at the satisfactory for this purpose. In the case of ordinary least- nominal level. For these reasons, it is appropriate to care- squares (OLS) regression models, one assumes that the fully scrutinize the residuals of the fitted OLS regression errors of the regression are independent. If this assump- equation for evidence of autocorrelated errors. tion is violated, the conventional hypothesis tests and Many tests have been developed to identify autocor- confidence intervals on intervention effect coefficients related errors; the classical Durbin–Watson (D–W) test are suspect (because they may be either conservative or (Durbin & Watson, 1950, 1951) is the best known. Two liberal, depending on the sign of the autocorrelation), and frequently cited problems with the D–W approach are that an alternative method of regression analysis designed to (1) the test statistic often falls in an inconclusive decision correct for dependency of the errors should be considered. region, and (2) the test was not designed to identify au- Common alternatives widely discussed in the econometric tocorrelated errors that are generated by processes other literature (see, e.g., Greene, 2000) and provided in popular than a first-order autoregressive model. software (such as SPSS 14.0) include several well-known Discussions of the inconclusive region problem are versions of feasible generalized least-squares estima- contained in virtually all econometrics and regression tors. These include the Cochrane–Orcutt, Prais–Winston, textbooks (see, e.g., Greene, 2000; Johnston, 1984; Kut- and maximum-likelihood methods. The purpose of these ner, Nachtsheim, & Neter, 2004). The problem is that the methods is to transform the regression equation in order to exact critical value for the test is a function of the specific remove first-order autocorrelation among the residuals of values in the design matrix. Because these values change the fitted equation. Although some researchers routinely with each application, the critical value is generally un- apply these methods without evaluating the need for them, known, but it is bounded. Durbin and Watson provided there is a price to be paid for using such methods when the tables (reproduced in many econometrics and regression errors are not autocorrelated. texts) that contain these bounds. These tables contain both Simulation work has shown that the application of these an upper and a lower critical value for each sample size methods in situations where there is no autocorrelation and number of predictors. If the obtained D–W test sta- among the errors leads to an increase in Type I error for tests tistic falls between the upper and lower critical values, B. E. Huitema, [email protected] 343 Copyright 2007 Psychonomic Society, Inc. 344 HUITEMA AND MCKEAN . the result is declared inconclusive. Solutions to the incon- hypothesis H0: ¬1 ¬2 ¬K 0. As is the case with the clusive region problem have been provided in the form B–P and L–B tests, the test statistic QH–M is essentially the sum of of special purpose computer routines (e.g., White, 1993) K ratios of squared autocorrelation estimates divided by their re- that are not found in most software packages, as well as spective error variance estimates. The proposed test differs from the conventional portmanteau tests in both the method used to estimate alternative tests (e.g., Huitema & McKean, 2000b). the autocorrelation coefficients and the method used to estimate the The second perceived problem (i.e., inadequate sensitiv- error variances. ity to errors generated by higher order error models) has led Previous work (Huitema & McKean, 2000b) has demonstrated to recommendations (see, e.g., Shumway, 1988) to apply that under the null hypothesis, the expected value of the conven- portmanteau autocorrelation tests that incorporate coeffi- tional autocorrelation estimator r1 is negatively biased by a term that cients computed at many lags in the autocorrelation func- is a function of the number of parameters in the intervention regres- sion model and the sample size. This term is added to each of the tion. Two well-known methods that do this are the Box– lag-1 through lag-K conventional autocorrelation estimates included Pierce (B–P) test (Box & Pierce, 1970) and the Ljung–Box in the test in order to reduce bias. It has also been demonstrated (L–B) test (Ljung & Box, 1978); both tests were originally that the error variance estimators incorporated in the conventional developed to evaluate errors of ARIMA models rather than B–P portmanteau test are positively biased with small samples (see errors of time-series regression models. Many popular soft- Huitema & McKean, 1991). Slightly modified versions of the re- ware packages (such as Minitab, Version 14.0, and SPSS, duced bias variance estimator used in the zH–M test statistic (Huitema & McKean, 2000b) are incorporated in the proposed QH–M test sta- Version 14.0) have implemented at least one of these tests. tistic. Because the modified autocorrelation and error variance esti- Although it has been frequently stated that the L–B test mators have been shown to be quite satisfactory in the zH–M test, we is superior to the B–P test in the context of conventional conjectured that the use of similar estimators in the joint-test context ARIMA models (see, e.g., Bowerman, O’Connell, & Koeh- would lead to small sample performance that is superior to that of ler, 2005; Mills, 1990; Yaffee & McGee; 2000), we have the B–P and L–B tests. recently shown (Huitema & McKean, 2007) that this is not true when these tests are applied to the residuals of certain B–P and L–B Test Statistics The expressions for the B–P and L–B test statistics are: regression models. Indeed, the L–B test has unacceptable K Type I error properties regardless of sample size when it 2 QNrB–P £ l is used in the context of interrupted time-series regression l1 models using design matrices of the form described in and K Huitema and McKean (2000a, 2000b). 1 QNNNlr2 2, The present article introduces and evaluates a portman- L–B £ l l 1 teau test for interrupted time-series regression models. It respectively. These test statistics are distributed approximately as chi was developed to provide more satisfactory small sample square with K p q degrees of freedom when they are applied to properties than the conventional L–B and B–P portman- the residuals of ARMA models fitted to sample data. teau tests. One can see that QB–P is simply N times the sum of the K squared autocorrelation coefficients, but that the expression for QL–B appears METHOD to be considerably more complex. It turns out, however, that QL–B is also N times the sum of K squared autocorrelation coefficients, but the autocorrelation coefficients are now redefined as Proposed Test Statistic QH–M The expression for the proposed test statistic Q is described */12 H–M rNll [(2 )/ ( Nlr )] . below. 2 The logic for the modification is that the conventional formula ª § PN() l 1 ¶¹ for these coefficients contains only N l terms for the the autoco- ««r º NN3() 1 K l ¨ 2 · variance, whereas there are N terms for the variance, regardless of Q ¬ © N ¸» , H–M 2 £ the lag. Hence, as the lag increases, the resulting coefficients are ()N 2 l1 ()Nl 1 increasingly biased toward zero. The modification was designed to where correct for this bias. Unfortunately, there are other sources of bias K is the number of lags included in the test (generally between in both the conventional and modified coefficients when they are N/15 and N/10), computed on the residuals of time-series regression models. The N is the total number of observations in the series, proposed test statistic was designed to reduce the large negative P is the number of parameters in the regression model, bias that is introduced by estimating multiple parameters in these models. rl is the autocorrelation coefficient computed on the residuals at lag-l; that is, The proposed testing procedure involves both a new expression for the test statistic and degrees of freedom that differ from those Nl ee used with the B–P and L–B methods.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    7 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us