V2504377 a Two-Sample Test of Equal Gamma Distribution Scale Parameters with Unknown Common Shape Parameter
Total Page:16
File Type:pdf, Size:1020Kb
TECHNOMETRICS 0, VOL. 25, NO. 4, NOVEMBER 1983 A Two-Sample Test of Equal Gamma Distribution Scale Parameters With Unknown Common Shape Parameter Wei-Kei Shiue Lee J. Bain Dept. of Mathematics and Computer Science Dept. of Mathematics and Statistics University of Nebraska-Omaha University of Missouri-Rolla Omaha, NE 68182-0243 Rolla, MO 65401 An approximate F-test for the equality of two gamma distribution scale parameters, given equal but unknown shape parameters, is proposed and investigated. The test is obtained by replacing the shape parameter by its maximum likelihood estimate. Monte Carlo and asymp- totic results show that in many cases this substitution does not seriously affect the nominal test size. Additionally, these results can be used to modify the test to more closely achieve the desired size. An example from a cloud seeding experiment is given. KEY WORDS: Gamma distribution; Two-sample tests; Tests of equal scale parameters; Approximate tests; Maximum likelihood. 1. INTRODUCTION distribution has been suggested,for example, as the failure time model for a system under continuous The gamma distribution is well known, dating back maintenance, where the reliability may experience at least to Laplace (1836),and it is Type III of Pear- some degradation (or growth) initially but then son’s system of distributions. Early applications were reachesa stable state as time goeson. involved more with its use as a derived distribution The gamma distribution has also receivedconsider- and its relationship to the chi-squared distribution, able attention in the area of weather analysis, and an but it has become increasingly important as a popu- example in this area is discussedlater. lation model, particularly now that its statistical Statistical techniques have been somewhat difficult properties are being developed. to develop for the gamma distribution, partly because Johnson and Kotz (1970) give a good general the parameters are not of the convenient location- review of the gamma distribution, including several scale type. Suppose that G(x; 0, IC)denotes a gamma referencesto applications in diverse fields. The gamma distribution with density model is consideredfrequently in the area of reliability 1 and life testing. This may be partly due to its relation- g(x; 8, K) =- x K-1,-x/s x >o; K, e > 0. ship to the Poisson process,since the waiting time to r(ti)eK ’ kth occurrence of a Poisson processfollows a gamma distribution. More importantly, however, it is a gener- (1) alization of the exponential distribution, and it pro- The mean is p = E(X) = 14. The parameters K and 0 vides a rather flexible skewed density defined over the are referred to as shape and scale parameters,respec- positive range. Its hazard function may be increasing tively. Optimum tests for 8 with K as an unknown or decreasing, but it approaches a constant as time nuisance parameter are derived by Engelhardt and approaches infinity. This represents a useful alter- Bain (1977), based on the conditional distribution of native to the Weibull distribution whose hazard func- X given 2, in which X and 1 denote the arithmetic and tion approaches either infinity or zero. The gamma geometric sample means,respectively. Tests for K with 377 378 WEI-KEI SHIUE AND LEE J. BAIN 0 unknown may be based on rl or, equivalently, on the true significancelevels S = In (X/g). Bain and Engelhardt (1975)provide ap- P(K, a) = Pr [X/p < &2n12)/2nCJ, proximate distributional results for S in the form (6) and show that the dependenceon K is small, although 2?lKCS i- x2(v), (2) the true level may be somewhat different from the where c and v are functions of n and K and x’(v) nominal level for small n. They provide information denotes a chi-squared random variable with v degrees for adjusting the test so that the approximate level will of freedom. For K 2 2, be near the prescribed nominal level. Almost identical procedurescan be applied to the two-sample case. 2nKS i, X*(n - 1). (3) Few other exact inferenceprocedures have beendevel- 3. TWO-SAMPLE TEST oped for the gamma distribution when both param- SupposeX denotesthe mean of a random sample of eters are assumed to be unknown. Grice and Bain size n, from G(x; 0i, K) and y is the mean of an (1980)provide approximate tests or confidence limits independent random sample of size n2 from G(y; eZ, for the mean of a gamma distribution when both K). If K is known, a size a test of H,: 8, = e2 against parameters are assumed to be unknown. Their ap- H,: 0i < 8, is to reject H, if proach is extended in this article to the problem of testing the equality of means (or scale parameters)for x -zr < F,(2rllK, 2n, K), (7) two independent gamma populations with the shape Y parametersassumed unknown but equal. Crow (1977) where F&vi, v2) denotes the lower a percentile of Sne- discussesthis problem for the log-normal and gamma decor’s F distribution. This follows becausewhen H, models for the shapeparameter known case,and con- is true, then siders an application to cloud seeding experiments. Barker (1981) discusses the optimum allocation of ---rvx R/K8, XZ{2rllK)/2rllK (8) sample sizesfor the sameproblem. P - VKo, X2(2n, K)/2rl,K' In the area of life testing, the limiting hazard rate is l/O, so that testing equal scale parameters is equiva- As for the one-samplecase, one expectsgood results lent to testing the equality of limiting hazard rates. if K is replaced by I? in (7), where Iz now denotes the Note that under the assumption of equal shape pa- MLE based on the combined sample data xi, . , x,,, rameters, the test of equal scale parametersis actually y,, ..., yn2. Crow (1977) uses this approach also in a test to determine whether the populations are identi- computing confidencelimits in a cloud seedingexperi- cal. Also, the assumption of equal shape parameters ment. The actual size of the approximate test is given may not always be appropriate and should not be by -- made automatically. For larger K values, the Equation P(K, a) = Pr [X/Y< F,(2n,rZ, 2n, <)I. (9) (3) approximation can easily be used to obtain an F This probability does not depend on 0 since the joint test ofH,: ICY= K~. - -. density of B and X/YIS free of 8, and it dependslittle on K. 2. ONE-SAMPLE TEST Values of P(K, &) have been estimated by Monte The approximate method studied by Grice and Carlo simulation for various values of K, ~1,nl, and n2. Bain (1980) for testing the mean in the one-sample Table 1 presentsthese for values of n, and n2 of 10 and case is briefly described first. Then, the method is 20. Limiting values of extendedto the two-sample case. P(0, a) = lim P(K, a) and P(co, CI)= lim P(K, a) It is well known that K-O K+cc are derived in Section 6 and included in Table 1. The 2nX/9 - X2(2nK), (4) values are helpful for verifying the Monte Carlo re- where x2(v) denotes a chi-squared distribution with v sults and for interpolation purposes outside the range degrees of freedom. When K is known, a uniformly of the Monte Carlo study. Similar results were ob- most powerful size c( test of H,: p 2 p0 against H,: tained for other combinations of n, and n2 and other c( p < p0 is to reject H, if levels. - Table 1 indicates that for fixed ~1,n,, and n2, the 2 < X;(2nK)/2nK. (5) P(K, a) are nearly constant except for small K and small n, and they are close to the limiting value P(co, It is natural to consider replacing K by R in (5) for c().Thus, (7) with K replaced by Iz provides an approxi- the K unknown case, where r?is the maximum likeli- mate test, particularly for moderate sample sizes,with hood estimator of K. Grice and Bain (1980) compute the true level being slightly above the prescribed level. TECHNOMETRICS $2, VOL. 25, NO. 4, NOVEMBER 1983 A TEST OF GAMMA SCALE PARAMETERS 379 Table 1. Values of P(Ic, a) 4. NUMERICAL EXAMPLE Crow (1977)considers hail data from a cloud seed- a ing experiment with n, = 16 and X = 13.366for non- n1 4 K .Ol .05 .lO seededdays and n2 = 17 and jj = 13.249for seeded days. Crow comments that the effect (if any) of seeding 10 10 0 ,017 ,063 .1 15 is expected to be an increase in the scale parameter. .l ,015 ,057 ,107 .2 ,014 ,057 ,107 He also computes the estimates R, = .382 and R, = .5 ,016 ,060 ,111 .466 with asymptotic standard errors of .109 and .131, 1 .o ,016 ,060 .llO respectively. Finding no significant difference he com- 2.0 ,017 ,062 ,113 co .018 ,063 ,113 putes the pooled estimate 12= .4211and computes the approximate 90% confidence interval for p = p2/p1 20 20 0 ,013 ,056 ,107 .l ,012 ,053 ,103 given by .2 ,012 ,053 ,102 ,. .5 .013 ,055 ,106 f F.,,(2n,k, 2n, 12) < p < f F,,,(2n,R, 2n, 2) 1.0 ,013 ,055 ,105 [- 2.0 ,014 ,056 ,107 1 cc ,014 ,056 .107 = (.39, 2.45). (11) 10 20 0 ,015 .060 ,113 .l ,013 ,055 ,105 Crow points out that the interval would presum- .2 ,013 ,055 ,105 ably be even longer if the uncertainty due to esti- .5 ,014 ,058 ,108 mating K were taken into account.