Maximum Multivariate Exponentially Weighted Moving Average And
Scientia Iranica E (2017) 24(5), 2605{2622
Sharif University of Technology Scientia Iranica Transactions E: Industrial Engineering www.scientiairanica.com
Maximum multivariate exponentially weighted moving average and maximum multivariate cumulative sum control charts for simultaneous monitoring of mean and variability of multivariate multiple linear regression pro les
R. Ghashghaei and A. Amiri
Department of Industrial Engineering, Faculty of Engineering, Shahed University, Tehran, Iran.
Received 24 November 2015; received in revised form 15 March 2016; accepted 13 August 2016
KEYWORDS Abstract. In some applications, quality of product or performance of a process is described by some functional relationships among some variables known as multivariate Multivariate multiple linear pro le in the literature. In this paper, we propose Max-MEWMA and Max- linear regression MCUSUM control charts for simultaneous monitoring of mean vector and covariance pro les; matrix in multivariate multiple linear regression pro les in Phase II. The proposed control Simultaneous charts also have the ability to diagnose whether the location or variation of the process monitoring; is responsible for out-of-control signal. The performance of the proposed control charts is Phase II; compared with that of the existing method through Monte-Carlo simulations. Finally, the Diagnosis aids. applicability of the proposed control charts is illustrated using a real case of calibration application in the automotive industry. © 2017 Sharif University of Technology. All rights reserved.
1. Introduction Woodall et al. [3], Wang and Tsung [4], Woodall [5], Zou et al. [6], and Amiri et al. [7]. A pro le monitoring Sometimes, quality of a process or product in the indus- problem consists of two phases, including Phase I try and non-industry is characterized by a relationship and Phase II. The purpose of Phase I monitoring between two or more variables, which is presented by is providing an analysis of the preliminary data for a `pro le' in the literature. In di erent applications, estimating the process parameters. The main purpose simple linear regression, multiple linear or polynomial of Phase II pro le monitoring is designing a control regression, or even more complicated models such as scheme to detect di erent out-of-control scenarios in nonlinear regressions are used to model the quality of the process parameters. Monitoring of di erent types processes. The applications of pro le monitoring in of linear regression pro les, such as simple linear, the literature have been introduced by several authors multiple linear, and polynomial regression pro les, has such as Kang and Albin [1], Mahmoud and Woodall [2], been investigated by many researchers. Kang and Albin [1], Kim et al. [8], Zou et al. [6], Keshtelia et al. [9], Noorossana and Ayoubi [10], and Chuang et *. Corresponding author. Tel.: +98 21 51212023; E-mail addresses: [email protected] (R. Ghashghaei); al. [11] have studied Phase II monitoring of simple [email protected] (A. Amiri) linear pro les. Researchers such as Kang and Albin [1], Mahmoud and Woodall [2], Mahmoud et al. [12], and doi: 10.24200/sci.2017.4385 Chuang et al. [11] have considered monitoring of simple 2606 R. Ghashghaei and A. Amiri/Scientia Iranica, Transactions E: Industrial Engineering 24 (2017) 2605{2622 linear pro les in Phase I. Multiple linear regression process parameters. Cheng and Thaga [31] proposed and polynomial pro les have been studied by Zou et Max-MCUSUM control chart to detect any changes al. [6], Kazemzadeh et al. [13,14], Amiri et al. [15], and in mean vector and covariance matrix simultaneously. Wang [16]. Zhang et al. [32] proposed a new control chart (ELR In some situations, quality of a process or a control chart) based on the combination of Generalized product can be e ectively characterized by two or more Likelihood Ratio (GLR) test and the Exponentially multiple linear regression pro les in which response Weighted Moving Average (EWMA) control chart for variables are correlated, referred to as multivariate simultaneous monitoring of mean vector and covari- multiple linear regression pro les [17]. Noorossana et ance matrix in the multivariate process. Wang et al. [18] proposed three control chart schemes for Phase al. [33] applied the generalized likelihood ratio test II monitoring of multivariate simple linear pro les. and the multivariate exponentially weighted moving They also used the term Average Run Length (ARL) covariance control chart to monitor the mean vector to evaluate the proposed control charts. Noorossana et and the covariance matrix of a multivariate normal al. [19] developed four control charts in Phase I for process, simultaneously. Other research for simulta- monitoring of multivariate multiple linear regression neous monitoring of multivariate process consists of pro les. They compared the performance of the Khoo [34], Hawkins and Maboudou-Tchao [35], Ramos proposed control charts through simulation studies in et al. [36], and Pirhooshyaran and Niaki [37]. For terms of probability of a signal. Zhang et al. [20] detailed information on simultaneous monitoring of the developed a new modelling and monitoring framework process location and dispersion, refer to the review for analysis of multiple linear pro les in Phase I. Their paper provided by McCracken and Chakraborti [38]. framework used the regression-adjustment method in According to the literature, in most of the control the functional principal component analysis. Eyvazian charts proposed for monitoring of multivariate multiple et al. [21] suggested four control charts to monitor mul- linear regression pro les, mean vector and covariance tivariate multiple linear regression pro les in Phase II. matrix are monitored with separate control charts. In They evaluated the performance of the proposed con- addition, these control charts are unable to diagnose trol charts through simulation studies in terms of the whether the location or the dispersion is responsible ARL criterion. They also used a numerical example for out-of-control signal. In this paper, we propose to assess the performance of the developed control two methods for simultaneous monitoring of mean charts. Amiri et al. [17] introduced a diagnosis method vector and covariance matrix in multivariate multiple to identify the pro les and parameters responsible linear regression pro les. Moreover, the proposed for out-of-control signal in multivariate multiple linear control charts are able to diagnose whether location or regression pro les in Phase II. dispersion is responsible for out-of-control signal. The In addition to the problems noted above, prac- performance of the proposed control charts is compared titioners are interested in developing some kinds of with ELRT control chart proposed by Eyvazian et control charts which can monitor mean and variabil- al. [21] in Phase II in terms of Average Run Length ity of processes simultaneously. In fact, the quality (ARL) and Standard Deviation Run Length (SDRL). engineers want to have a single control chart instead The structure of the rest of this paper is as follows: of two or more. In practice, the control charts for In Section 2, we express multivariate multiple linear monitoring both process mean and variability should regression pro les model. In Section 3, the ELRT be implemented together because assignable causes can control chart is described. In Section 4, the proposed a ect both of them. In recent years, joint monitoring control charts are expressed. In Section 5, diagnosing of process mean and dispersion in both univariate procedure is explained. In Section 6, the comparison and multivariate cases has been considered by some analysis of the proposed control charts is provided researchers. Simultaneous monitoring of the mean and through an illustrative example. In Section 7, the variance in univariate process has been studied by application of the proposed control charts is illustrated Khoo et al. [22], Zhang et al. [23], Guh [24], Memar by a real dataset. Finally, our concluding remarks and and Niaki [25], Teh et al. [26], Sheu et al. [27], Haq et future research are given in Section 8. al. [28], and Prajapati and Singh [29]. The simultane- ous monitoring of multivariate process mean vector and 2. Model and Assumptions covariance matrix has also been addressed by several authors. Chen et al. [30] proposed a control scheme Let us assume that for the kth random sample col- called Max-EWMA for monitoring both mean and lected over time, we have n observations given as variance simultaneously. They performed a comparison (x1i; x2i; : : : ; xqi; y1ik; y2ik; : : : ; ypik), i = 1; 2; : : : ; n analysis and found that their proposed control chart where p and q are the numbers of response variables performed better than the combination of chi-square and explanatory variables, respectively. When the and jSj control charts when small shifts occurred in the process is in statistical control, the model that relates R. Ghashghaei and A. Amiri/Scientia Iranica, Transactions E: Industrial Engineering 24 (2017) 2605{2622 2607
the response variables with explanatory variables is a in which is covariance matrix of error terms, ECk multivariate multiple linear regression model and is and ESk are corresponding exponentially weighted given as follows [21]: moving average statistics given by:
Yk = XB + Ek; (1) ECk = Ck + (1 ) ECk 1; (6) or equivalently: ESk = Sk + (1 ) ESk 1; (7) 2 3 y y : : : y Pn 11k 12k 1pk where C is equal to (y x B) 1(y x B)T , 6y y : : : y 7 k ik i ik i 6 21k 22k 1pk 7 i=1 6 . . . . 7 in which (y x B) is the ith row of matrix (Y XB). 4 . . .. . 5 ik i k Sk in Eq. (7) is computed as: yn1k yn2k : : : ynpk 2 3 2 3 ^ T ^ (Yk X(EBk)) (Yk X(EBk)) 1 x11 : : : x1q 01 02 : : : 0p Sk = ; (8) 6 7 6 7 n 61 x21 : : : x2q 7 6 11 12 : : : 1p7 = 6. . .. . 7 6 . . .. . 7 ^ 4. . . . 5 4 . . . . 5 where EBk is exponentially weighted moving average ^ 1 xn1 : : : xnq q1 q1 : : : qp statistic of Bk given by: 2 3 ^ ^ ^ "11k "12k :::"1pk EBk = Bk + (1 ) EBk 1: (9) 6 7 6"21k "22k :::"1pk 7 + 6 . . . . 7 ; B^ in Eq. (9) is the MLE of B for the kth sample, 4 . . .. . 5 K . . . (2) which is same as the OLS estimator given in Eq. (4). "n1k "n2k :::"npk where Yk is an n p matrix of response variables 4. Proposed control charts for the kth sample, X is an n (q + 1) matrix of In this section, we propose two control charts including explanatory variables, B is a (q + 1) p matrix of Max-MEWMA and Max-MCUSUM to monitor mean regression parameters, and E is an n p matrix of k vector and covariance matrix simultaneously in multi- error terms. Note that the vector of error terms follows variate multiple linear regression pro les. a p-variate normal distribution with mean vector 0 and p p covariance matrix as [21]: 4.1. Max-MEWMA control chart ^ 0 1 Matrix BK is rewritten as a 1 ((q + 1)p) multivariate 11 12 : : : 1p normal random vector denoted by ^ : B C k B21 22 : : : 2pC = ; (3) B . . .. . C ^ ^ ^ ^ ^ ^ ^ @ . . . . A k =( 01k; 11k; :::; q1k; 02k; 12k; :::; q2k; : : : p1 p2 pp ^ ^ ^ :::; 0pk; 1pk; :::; qpk); (10) where hj denotes the covariance between error vector terms of the hth and jth response variables at each when the process is in-control, the expected value and ^ observation. For the kth random sample, the Ordinary covariance matrix for k are given as follows [21]: Least Square (OLS) estimator of matrix B is given by: ^ E( k) = ( 01; 11; :::; q1; 02; 12; ::::; q2; ^ T 1 T Bk = (Xk Xk) Xk Yk: (4) :::; 0p; 1p; :::; qp); (11) 3. Existing work: ELRT control charts for 0 1 ::: simultaneous monitoring of mean vector 11 12 1p B ::: C and covariance matrix in Phase II B 21 22 2pC ^ = B . . . . C ; (12) k @ . . .. . A In this section, we give a brief review of the proposed ::: ELRT control chart by Eyvazian et al. [21]. The p1 p2 pp likelihood ratio statistic is given as: where ( 01; 11; : : : ; q1; 02; 12; : : : ; q2; : : : ; 0p; 1p; : : : ; qp) is denoted by . Eyvazian et al. [21] have ELRTk = n log jj n log jESkj + ECk np; (5) shown that hj is a (q + 1) (q + 1) matrix equal to T 1 [X X] hj where hj denotes the hjth element of k = 1; 2;:::; the covariance matrix in Eq. (3). 2608 R. Ghashghaei and A. Amiri/Scientia Iranica, Transactions E: Industrial Engineering 24 (2017) 2605{2622
In this method, we extend the proposed method control chart as: by Chen et al. [30] for simultaneous monitoring of mean vector and covariance matrix of the regression Mk = max fjCkj ; jSkjg k = 1; 2; ::: (21) parameters estimators in the multivariate multiple linear regression pro les. De ne: Since Mk is the maximum jCkj and jSkj, which are based on two Multivariate Exponentially Weighted ^ Moving Average (MEWMA) statistics, it is natural to zk = ( k ) + (1 ) zk 1; k = 1; 2; :::; (13) name the control chart, based on Mk, Max{MEWMA control chart Chen et al.[30]. A large value of Mk where z0 is the starting point and it is equal to zero means that the process mean vector and/or covariance vector. is the smoothing parameter satisfying 0 matrix has shifted away from and , respectively. 1. We have: Because Mk is non-negative, the initial state of the Max{MEWMA control chart is based only on an upper = [1 (1 )2k] ; (14) zk ^ control limit (h). If M > h, the control chart triggers 2 k k an out-of-control alarm, where h > 0 is chosen to (2 ) achieve a speci ed in-control ARL. T = zT 1 z 2 : (15) k [1 (1 )2k] k ^ k (q+1)p;2 4.2. Max-MCUSUM control chart In Eq. (15), (q +1)p and 2 are respectively the degrees In this section we explain the structure of Max{ of freedom and the non-centrality parameter of the non- MCUSUM control chart to use it for monitoring mul- central chi-square distribution with: tivariate multiple linear regression pro les in Phase II. The cumulative sum (CUSUM) procedure for monitor- 2 2k 0 = (2 )= 1 (1 ) ( b g) ing mean vector of multivariate quality characteristics signals when the value of Sk becomes greater than L. 1 2 ( b g) (q+1)p;2 ; (16) fb(xk) where is good mean (when the process is in-control) Sk = max 0;Sk 1 + log > L; (22) g fg(xk) and b is bad mean (when the process is out-of- control). We de ne the statistic for monitoring the where fg and fb are probability density functions cor- process mean vector as: responding to quality characteristics under in-control and out-of-control conditions, respectively, and L is (2 ) C = 1 H zT 1z ; a constant that determines the Upper Control Limit k (q+1)p [1 (1 )2k] k ^ k (17) (UCL) of the statistic in Eq. (22). In this section, we extend the method proposed by Cheng and Thaga [31]. where H (.) is the chi-square distribution function (q+1)p We assume that k comes from a multivariate with (q + 1)p degrees of freedom, (.) is the standard normal distribution with either a good mean, g, for normal cumulative distribution function, and 1 is in-control process or bad mean, b ( b= g + ), for the inverse of (:). out-of-control process. Note that the covariance matrix For monitoring process variability, we de ne: of error terms, , is assumed known. For a multivariate normal distribution, the CUSUM chart is developed Xn 1 T through the likelihood ratio given as: Wk = (yik xiB) (yik xiB) ; (18) i=1 f (x ) b k = such that Wk is the chi-square distribution with np fg(xk) degrees of freedom. 1=2 1 p=2 1 ^ 1 ^ 0 gk = (1 )gk 1 + fHnp(Wk)g ; (19) (2) exp( 0:5( ) ( )) ^ k b ^ k b k k : (23) 1=2 where g0 is the starting point and it is equal to zero. (2) p=2 1 exp( 0:5( ^ ) 1( ^ )0) ^ k g ^ k g is the smoothing parameter, 0 1. We have k k E(g ) = 0 and V ar(g ) = [1 (1 )2k]. k k 2 Taking natural logarithms (see Appendix A), we ob- The statistic for monitoring the process variability tain: is de ned as: s f (x ) log b k =( ) 1 ^0 0:5( + ) 2 b g ^ k b g S = g : (20) fg(xk) k k (1 (1 )2k) k 1( )0: (24) ^ b g Combining Ck and Sk de nes a statistic for a single k R. Ghashghaei and A. Amiri/Scientia Iranica, Transactions E: Industrial Engineering 24 (2017) 2605{2622 2609
Replacing Eq. (24) into Eq. (22), CUSUM statistic for The CUSUM control chart for detecting a shift in the the multivariate normal process is obtained. Note that covariance matrix of a multivariate multiple regression both sides of Eq. (24) are divided by a constant value. pro le is written as: CUSUM statistic for monitoring the multivariate mul- ^ 1 ^ 0 Sk = max (Sk 1 +( ) ( ) ; 0); tiple linear regression pro le is given as follows: k g ^ k g k (33) ^ Sk = max(Sk 1 + a k w; 0); (25) b = log(b) : (34) where: b 1 ( ) 1 b g ^ To design a single multivariate CUSUM control chart a = q k ; (26) 1 0 for simultaneous monitoring of mean vector and co- ( b g) ^ ( b g) k variance matrix in the multivariate multiple linear regression pro les in Phase II, we use the following and: transformation: ( ) 1( )0 h n oi b g ^ b g 1 1 0 q k Y = H ( ^ ) ( ^ ) ; (35) w = 0:5 : (27) k (q+1)p k g ^ k g 1 0 k ( b g) ^ ( b g) k where H(q+1)p() is the chi-square distribution function De ne the non-centrality parameter as: with (q + 1)p degrees of freedom, () is the standard q normal cumulative distribution function, and 1 is D = ( ) 1( )0; (28) the inverse of (). b g ^ b g k For monitoring the process variability, we de ne: and: Vk = max (0;Yk + Vk 1): (36) ^ 0 Zk = a( k g) : (29) Combining Uk and Vk de nes a statistic for multivari- The CUSUM control chart for detecting a shift in ate single control chart as: the multivariate multiple pro les regression coecients vector is written as: Mk = max (Uk;Vk): (37)
Uk = max (0;Uk 1 + Zk 0:5D): (30) The proposed control chart is called Maximum Mul- tivariate CUSUM (Max-MCUSUM) control chart, be- By using the likelihood ratio test technique above cause the maximum CUSUM statistic is applied. Since and assuming the good state and bad state, Healy [39] Mk is non-negative, the initial state of the Max- developed a CUSUM chart for the process standard MCUSUM control chart is based only on an upper deviation. When the process is in a good state, ^ control limit (L). If Mk > L, the control chart triggers is distributed as multivariate normal with good mean, an out-of-control alarm, where L > 0 is chosen to g and covariance matrix, ^. In the case of shift in achieve a speci ed in-control ARL. variability, we assume that the covariance matrix shifts to b for b > 0 and mean vector does not change. ^ 5. Diagnosing procedure The likelihood ratio for process standard devia- tion is given in Eq. (31) as shown in Box I. Taking In a diagnosing procedure for Max-MEWMA control natural logarithms (see Appendix B), we obtain: chart, the following algorithm is proposed to determine the source and the direction of the shift: fb(xk) 1 ^ log = log b + 0:5( k g) fg(xk) 2 - Case 1: Mk = jCkj > UCL and jSkj UCL. It indicates that only the process mean experiences a 1 shift. If C > 0, the shift is increasing and it is 1( ^ )0(1 ): (32) k ^ k g k b decreasing if Ck < 0,