Robust Weighted Likelihood Estimation of Exponential Parameters Ejaz S

IEEE TRANSACTIONS ON RELIABILITY, VOL. 54, NO. 3, SEPTEMBER 2005 389 Robust Weighted Likelihood Estimation of Exponential Parameters Ejaz S. Ahmed, Andrei I. Volodin, and Abdulkadir. A. Hussein Abstract—The problem of estimating the parameter of an ex- • sample mean ponential distribution when a proportion of the observations are • -trimmed sample mean outliers is quite important to reliability applications. The method • quadratic risk of a statistics of weighted likelihood is applied to this problem, and a robust estimator of the exponential parameter is proposed. Interestingly, • , Convergence as fast as a, convergence faster the proposed estimator is an -trimmed mean type estimator. than The large-sample robustness properties of the new estimator are examined. Further, a Monte Carlo simulation study is conducted I. INTRODUCTION showing that the proposed estimator is, under a wide range of contaminated exponential models, more efficient than the usual HE EXPONENTIAL distribution is the most commonly maximum likelihood estimator in the sense of having a smaller T used model in reliability & life-testing analysis [1], [2]. risk, a measure combining bias & variability. An application of the We propose a method of weighted likelihood for robust estima- method to a data set on the failure times of throttles is presented. tion of the exponential distribution parameter. We establish that Index Terms—Exponential distribution, maximum likelihood es- the suggested weighted likelihood method provides -trimmed timation, robust estimation, weighted likelihood method. mean type estimators. Further, we investigate the robustness properties of the weighted likelihood estimator (WLE) in com- ACRONYMS1 parison with the usual maximum likelihood estimator (MLE). This can be viewed as an application of the classical likelihood • a.s. almost surely that was introduced by [3] as the relevance weighted likelihood • WLE weighted likelihood estimator to problems of robust estimation of parameters. The weighted • MLE maximum likelihood estimator likelihood method was introduced as a generalization of the local likelihood method. For further discussion of the local like- NOTATION lihood method in the context of nonparametric regression, see • random sample size [4]–[6]. In contrast to the local likelihood, the weighted likeli- • parameter of interest hood method can be global, as demonstrated by one of the ap- • parametric space plications in [7] where the James-Stein estimator is found to • a parametric family of cumulative be a maximum weighted likelihood estimator with weights es- distribution functions timated from the data. • a parametric family of probability The theory of weighted likelihood enables a bias-precision density functions trade of to be made without relying on a Bayesian approach. • the usual maximum likelihood estimator The latter permits the bias-variance trade of to be made in a of parameter conceptually straightforward manner. However, the reliance on • the proposed estimator of the parameter empirical Bayes methods softens the demands for realistic prior • the natural logarithm function modeling in complex problems. The weighted likelihood theory • special weights that we use for the esti- offers a simple alternative to the empirical Bayesian approach mation method for many complex problems. At the same time, it links within • an obstructing parameter a single formal framework a diverse collection of statistical do- • mains, such as weighted least squares, nonparametric regres- the set of obstructing distributions where , and sion, meta-analysis, and shrinkage estimation. The weighted likelihood principle comes with an underlying general theory • , , fixed positive constants that extends Wald’s theory for maximum likelihood estimators, as shown in [8]. Manuscript received August 28, 2003. This research was supported by the We propose applying the method of weighted likelihood to Natural Sciences and Engineering Research Council of Canada (NSERC). As- the problem of the robust estimation of the parameter of an sociate Editor: M. Xie. exponential distribution by assigning zero weights to observa- S. E. Ahmed and A. A. Hussein are with the Department of Mathematics and Statistics, University of Windsor, Windsor, Ontario, N9B 3P4 Canada. tions with small likelihood. Interestingly, the weighted likeli- A. I. Volodinis with the Department of Mathematics and Statistics, University hood method yields -trimmed mean type estimators of the of Regina, Regina, Saskatchewan, S4S 4G2 Canada. parameter of interest. The statistical asymptotic properties of Digital Object Identifier 10.1109/TR.2005.853276 the WLE is developed, and a simulation study is conducted to 1The singular and plural of an acronym are always spelled the same. appraise the behavior of the proposed estimators for moderate 0018-9529/$20.00 © 2005 IEEE 390 IEEE TRANSACTIONS ON RELIABILITY, VOL. 54, NO. 3, SEPTEMBER 2005 sample sizes in comparison with the usual maximum likelihood tion function . That is, is defined by the given small estimator. probability in the equation An analogous idea appears in [9]. However, these authors use a different type of likelihood equation in which they do not weigh the likelihood of each observation, but rather the loga- rithmic derivatives. Hence, the proposed estimation strategy is a completely different approach. From this equation, we obtain . This approximation of is reasonable because in our calculations we II. THE PROPOSED WEIGHTED LIKELIHOOD ESTIMATOR will replace by the limit in probability (even almost surely) Let in general be a parametric family of cu- of this estimator, i.e., by : So, we choose the value of as mulative distribution functions with corresponding probability density functions . Estimators of obtained by maximizing the weighted log- likelihood function Hence, we reject an observation from the sample if . The probability is user-dependent as in [9], and can be considered as a measure of the level of desired robustness. Example: Here we consider a data set from [1] (also used where depends on the sample ; are called by [10]) on the failure times of throttles. The data reports the Weighted Likelihood Estimators (WLE) of . If all the weights failure times of 25 units where time is measured in Kilome- are equal to unity, then the resulting estimator is the usual ters driven prior to failure. According to exponential model fit, maximum likelihood estimator. the estimated rate of failure for this data is .We In this paper, we suggest a special choice of weights ; con- added 5% contamination from a different exponential with rate nected with the theory of robust estimation, and based on the .000 05. That is equivalent to augmenting the data by one ob- maximum likelihood method with rejection of spurious obser- servation. The estimate of after contamination is . vations. Let be the usual maximum likelihood At level of robustness, the rejection criteria is estimator of the parameter . The weight that corresponds to . Thus, only the one extra observation will be Re- the observation is assumed to be 1, if its estimated likelihood moved, and now . is suffciently large, and 0 elsewhere; that is Furthermore, to illustrate that the proposed procedure & the if -trimmed estimators do not necessarily coincide, we notice elsewhere. that a .01-trimming procedure would not reject the contaminant observation, whereas our robust procedure with would Consequently, we delete all improbable observations from the reject it. sample (the choice of is discussed in the next paragraph). Not In the following section, we study the robustness properties of surprisingly, it can be seen that in the case of a uni-modal proba- the & by using contaminated (obstructing) distributions. bility density function we reject only extreme order statistics. However, this may not be the case for multi-modal probability III. ROBUSTNESS OF THE PROPOSED ESTIMATOR density functions. The proposed estimator of the parameter is defined as In general, define a set of obstructing distributions as the solution of the equation (1) where denotes the cumulative distribution function with density function , , and : As it is commonly done where are the remaining observations in the in the classical robustness studies (c.f., [11]), under the assump- sample after the rejection procedure. tion that the sample is taken from the distribution with fixed In the case of the exponential distribution, these estimators one would find the limits in probability of the estimates & are obtained by using in the above equations the appropriate for , and compare their biases. The values of for distribution, and density functions given by, respectively which is less than , indicate the regions in the parameter space where the estimate is to be favored over the estimate . Along the same lines, we shall also compare the quadratic risks of these estimates. The quadratic risk of an es- where , . timator is the expected (average) squared distance between the Regarding the issue of the choice of , we suggest ; estimator, and the parameter being estimated. In other words, where can be chosen as in the selection of the critical constant the quadratic risk is an omnibus measure of the performance of in the criterion of elimination of outliers. Therefore, in the ex- an estimator which takes into consideration the bias & the pre- ponential case, is chosen from the condition of a small proba- cision (variance) of the estimator. bility of rejection of an observation assuming that the observa- Let ; , and be any positive

Robust Weighted Likelihood Estimation of Exponential Parameters Ejaz S

Lecture 12 Robust Estimation

Robustness of Parametric and Nonparametric Tests Under Non-Normality for Two Independent Sample

Robust Methods in Biostatistics

A Guide to Robust Statistical Methods in Neuroscience

Robust Statistical Methods in R Using the WRS2 Package

Robustbase: Basic Robust Statistics

Robust Statistics in BD Facsdiva™ Software

Application of Robust Regression and Bootstrap in Productivity Analysis of GERD Variable in EU27

Robust Statistics

Robust Statistics: a Brief Introduction and Overview

Robust Improper Maximum Likelihood: Tuning, Computation, and a Comparison with Other Methods for Robust Gaussian Clustering

Robust Statistics Part 1: Introduction and Univariate Data General References