Robust Location Estimators for the X Control Chart

mss # 1500.tex; art. # 06; 43(4) Robust Location Estimators for the X Control Chart MARIT SCHOONHOVEN Institute for Business and Industrial Statistics of the University of Amsterdam (IBIS UvA), Plantage Muidergracht 12, 1018 TV Amsterdam, The Netherlands HAFIZ Z. NAZIR Department of Statistics, University of Sargodha, Sargodha, Pakistan MUHAMMAD RIAZ King Fahad University of Petroleum and Minerals, Dhahran 31261, Saudi Arabia Department of Statistics, Quad-i-Azam University, 45320 Islamabad 44000, Pakistan RONALD J. M. M. DOES IBIS UvA, Plantage Muidergracht 12, 1018 TV, Amsterdam, The Netherlands This article studies estimation methods for the location parameter. We consider several robust location estimators as well as several estimation methods based on a phase I analysis, i.e., the use of a control chart to study a historical dataset retrospectively to identify disturbances. In addition, we propose a new type of phase I analysis. The estimation methods are evaluated in terms of their mean-squared errors and their e↵ect on the X control charts used for real-time process monitoring (phase II). It turns out that the phase I control chart based on the trimmed trimean far outperforms the existing estimation methods. This method has therefore proven to be very suitable for determining X phase II control chart limits. Key Words: Estimation; Mean; Phase I; Phase II; Robust; Shewhart Control Chart; Statistical Process Control; Trimean. Introduction ters, and an optimal performance requires that any change in these parameters should be detected as HE PERFORMANCE of a process depends on the soon as possible. To monitor a process with respect T stability of its location and dispersion parame- to these parameters, Shewhart introduced the idea of control charts in the 1920s. The present paper focuses on phase I location-estimation methods for Ms. Schoonhoven is a Consultant in Statistics at IBIS UvA. constructing the location control chart. She is working on her PhD, focusing on control-charting tech- niques. Her email address is [email protected]. Let Yij, i = 1, 2, 3, . and j = 1, 2, . , n, de- Mr. Nazir is a Lecturer of Statistics at the University of note phase II samples of size n taken in sequence Sargodha. His email address is hafi[email protected]. of the process variable to be monitored. We assume 2 the Yij’s to be independent and N(µ + δσ, σ ) dis- Dr. Riaz is an Assistant Professor in the Department tributed, where δ is a constant. When δ = 0, the of Mathematics and Statistics of King Fahad University of mean of the process is in control; otherwise, the pro- Petroleum and Minerals. His email address is riaz76qau@ n cess mean has changed. Let Y i= (1/n) Yij be yahoo.com. j=1 an estimate of µ + δσ based on the ith sample Yij, P Dr. Does is Professor in Industrial Statistics at the Uni- j = 1, 2, . , n. When the in-control µ and σ are versity of Amsterdam, Managing Director of IBIS UvA, and known, the process mean can be monitored by plot- fellow of ASQ. His email address is [email protected]. ting Y¯i on a control chart with respective upper and Vol. 43, No. 4, October 2011 363 www.asq.org mss # 1500.tex; art. # 06; 43(4) 364 MARIT SCHOONHOVEN ET AL. lower control limits obtained by averaging the conditional RL distribution over all possible values of the parameter esti- UCL = µ + Cnσ/pn, mates. The unconditional p is LCL = µ Cnσ/pn, (1) − p = E(P (F µˆ, σˆ)), (5) i | where C is the factor such that, for a chosen type I n and the unconditional average run length is error probability p, we have 1 ¯ ARL = E . (6) P (LCL Yi UCL) = 1 p. P (F µˆ, σˆ) − ✓ i | ◆ ¯ When Yi falls within the control limits, the process Quesenberry (1993) showed, for the X control chart, is deemed to be in control. We define Ei as the event that the unconditional in-control and out-of-control ¯ that Yi falls beyond the limits, P (Ei) as the probabil- ARL values are higher than in the case where the ity that sample i falls beyond the limits and RL as the process parameters are known. Furthermore, a higher run length, i.e., the number of samples until the first in-control ARL is not necessarily better because the ¯ Yi falls beyond the limits. When µ and σ are known, RL distribution will reflect an increased number of the events Ei are independent, and therefore RL is short RLs as well as an increased number of long RLs. geometrically distributed with parameter p = P (Ei). He concluded that, if limits are to behave like known It follows that the average run length (ARL) is given limits, the number of samples in phase I should be by 1/p and that the standard deviation of the run at least 400/(n 1). length (SDRL) is given by p1 p/p. − − Jensen et al. (2006) conducted a literature survey In practice, the process parameters µ and σ are of the e↵ects of parameter estimation on control- usually unknown. Therefore, they must be estimated chart properties and identified some issues for fu- from samples taken when the process is assumed to ture research. One of their main recommendations be largely in control. This stage in the control chart- is to study robust or alternative estimators for µ and ing process is denoted as phase I (cf., Woodall and σ (e.g., Rocke (1989, 1992), Tatum (1997), Vargas Montgomery (1999), Vining (2009)). The resulting (2003), Davis and Adams (2005)). The e↵ect of us- estimates determine the control limits that are used ing these robust estimators on phase II should also to monitor the location of the process in phase II. De- be assessed (Jensen et al. (2006, p. 360)). These rec- fine µˆ and σˆ as unbiased estimates of µ and σ, respec- ommendations are the subject of the present paper, tively, based on k phase I samples of size n, which are i.e., we will examine alternative location-estimation denoted by Xij, i = 1, 2, . , k and j = 1, 2, . , n. methods as well as the impact of these estimators on The control limits can be estimated by the phase II performance of the X control chart. UCL = µˆ + Cnσˆ/pn, So far the literature has proposed several alternative robust location estimators. Rocke (1989) pro- LCL = µˆ Cnσˆ/pn. (2) d − posed the 25% trimmed mean of the sample means, ¯ Let Fi denote thedevent that Yi is above UCL or the median of the sample means, and the mean of below LCL. We define P (F µˆ, σˆ) as the probability the sample medians. Rocke (1992) gave the practical i | that sample i generates a signal given µˆ and dσˆ, i.e., details for the construction of the charts based on d these estimators. Alloway and Raghavachari (1991) P (F µˆ, σˆ) = P (Y¯ < LCL or Y¯ > UCL µˆ, σˆ). i | i i | constructed a control chart based on the Hodges– (3) Lehmann estimator. Tukey (1997) and Wang et al. Given µˆ and σˆ, the distributiond of thedrun length (2007) developed the trimean estimator, which is de- is geometric with parameter P (F µˆ, σˆ). Conse- i | fined as the weighted average of the median and quently, the conditional ARL is given by the two other quartiles. Finally, Jones-Farmer et al. 1 (2009) proposed a rank-based phase I control chart. E(RL µˆ, σˆ) = . (4) Based on this phase I control chart, they define the | P (Fi µˆ, σˆ) | in-control state of a process and identify an in-control reference sample. The resultant reference sample can In contrast with the conditional RL distribution, be used to estimate the location parameter. the unconditional RL distribution takes into account the random variability introduced into the charting In this article, we compare existing and new meth- procedure through parameter estimation. It can be ods for estimating the in-control µ. The collection of Journal of Quality Technology Vol. 43, No. 4, October 2011 mss # 1500.tex; art. # 06; 43(4) ROBUST LOCATION ESTIMATOR FOR THE X CONTROL CHART 365 methods includes robust sample statistics for loca- We also consider three robust estimators proposed tion and estimation methods based on phase I control earlier by Rocke (1989): the median of the sample charts (cf., Jones-Farmer et al. (2009)). The methods means, are evaluated in terms of their mean-squared errors M(X) = median(X , X , . , X ); (8) (MSE) and their e↵ect on the X phase II control- 1 2 k chart performance. We consider situations where the the mean of the sample medians, phase I data are uncontaminated and normally dis- 1 k tributed, as well as various types of contaminated M= M , (9) k i phase I situations. i=1 X The remainder of the paper is structured as fol- with Mi the median of sample i; and the trimmed lows. First, we present several phase I sample statis- mean of the sample means, tics for the process location and assess the MSE k k↵ of the estimators. Then we describe some existing 1 d e X↵ = X , (10) phase I control charts and present a new algorithm k 2 k↵ ⇥ 2 (v)3 − d e v= k↵ +1 for phase I analysis. Following that, we present the dXe 4 5 design schemes for the X phase II control chart and where ↵ denotes the percentage of samples to be trimmed, z denotes the ceiling function, i.e., the derive the control limits.

Robust Location Estimators for the X Control Chart

M2 Mid-Module Assessment Task

Mgtop 215 Chapter 3 Dr. Ahn

Chap. 4: Summarizing & Describing Numerical Data

Modification of ANOVA with Various Types of Means

Stat 216 Excel/Phstat Homework Assignment #1

Math 366 Minitab Homework Assignment # 1 Due 28 February 2002

Rapid Assessment Method Development Update : Number 1

Summarizing Data Major Properties of Numerical Data

Mgtop 215 Chapter 3 Dr. Ahn

Descriptive Statistics .Pdf

Visualizing Multivariate Data

Mathematical Statistics