Fiducial Generalized Confidence Intervals

Jan HANNIG,HariIYER, and Paul PATTERSON

Generalized pivotal quantities (GPQs) and generalized confidence intervals (GCIs) have proven to be useful tools for making inferences in many practical problems. Although GCIs are not guaranteed to have exact frequentist coverage, a number of published and unpublished simulation studies suggest that the coverage probabilities of such intervals are sufficiently close to their nominal value so as to be useful in practice. In this article we single out a subclass of generalized pivotal quantities, which we call fiducial generalized pivotal quantities (FGPQs), and show that under some mild conditions, GCIs constructed using FGPQs have correct frequentist coverage, at least asymptoti- cally. We describe three general approaches for constructing FGPQs—a recipe based on invertible pivotal relationships, and two extensions of it—and demonstrate their usefulness by deriving some previously unknown GPQs and GCIs. It is fair to say that nearly every published GCI can be obtained using one of these recipes. As an interesting byproduct of our investigations, we note that the subfamily of fiducial generalized pivots has a close connection with fiducial inference proposed by R. A. Fisher. This is why we refer to the proposed generalized pivots as fiducial generalized pivotal quantities. We demonstrate these concepts using several examples. KEY WORDS: Asymptotic properties; Common problem; Conditional inference; ; Generalized pivot; Structural inference; Structural method.

1. INTRODUCTION Fisher (1935) gave a justification based on the fiducial argu- ment. Weerahandi (1993) derived a GPQ for µ − µ and re- Tsui and Weerahandi (1989) introduced the concept of 1 2 marked that the resulting interval coincided with the fiducial generalized p values and generalized test variables, which solution. are useful for developing hypothesis tests in situations where In this article we identify an important subclass of GPQs, traditional frequentist approaches do not provide useful solu- which we call fiducial generalized pivotal quantities (FGPQs) tions. Subsequently, Weerahandi (1993) introduced the concept for reasons that we discuss shortly. We also provide some gen- of a generalized pivotal quantity (GPQ) for a scalar parame- eral methods for constructing FGPQs for large classes of prob- ter θ, which can be used to construct an interval estimator lems. Nearly every published GCI can be obtained using these for θ in situations where standard pivotal quantity-based ap- methods. More important, and perhaps of greater interest to proaches may not be applicable. He referred to such intervals practitioners, we also show that, under reasonable assumptions, as generalized confidence intervals (GCIs). Since then, several GCIs based on FGPQs have asymptotically correct frequen- GCIs have been constructed in many practical problems (see, tist coverage. This result provides a frequentist justification for e.g., Weerahandi 1995; Chang and Huang 2000; Hamada and GCIs (and also for generalized tests, although our focus here is Weerahandi 2000; McNally, Iyer, and Mathew 2001; Burdick confidence intervals) when the GPQ is chosen appropriately. In and Park 2003; Krishnamoorthy and Lu 2003; Krishnamoorthy addition, we provide a number of examples to illustrate these and Mathew 2003; Iyer, Wang, and Mathew 2004; Mathew and results. Krishnamoorthy 2004; Weerahandi 2004; Burdick, Borror, and The reason that we chose the term “FGPQ” is that GCIs Montgomery 2005; Burdick, Park, Montgomery, and Borror based on FGPQs are in fact obtainable using the fiducial ar- 2005; Daniels, Burdick, and Quiroz 2005; Roy and Matthew gument of Fisher (1935) within a suitably chosen framework, 2005). These intervals do not always have exact frequentist cov- such as the pivotal quantity approach of Barnard (1977, 1981, erage. Nevertheless, results of simulation studies reported in the 1982, 1995), the structural inference of Fraser (1966, 1968), literature appear to support the claim that coverage probabilities and the functional model basis for fiducial inference discussed of GCIs are sufficiently close to their stated value so that they by Dawid and Stone (1982). We establish the connection be- are in fact useful procedures in practical problems. Despite the tween FGPQs and fiducial distributions by showing that, given large number of successful applications of GCIs reported in the a fiducial distribution for a , there is a systematic pro- literature, it is surprising that there are no published theoretical cedure for constructing a FGPQ whose distribution is the same results discussing either small-sample properties or asymptotic as the fiducial distribution. As a byproduct, this connection of behavior of GCIs. FGPQs with fiducial inference leads to a frequentist justifica- A simple test case for the application of GCIs is the Gaussian tion for fiducial inference in many settings. In fact, FGPQs pro- two-sample problem with heterogeneous , where one vide a natural framework for associating a distribution with a is interested in a confidence interval for the difference µ1 − µ2 parameter. between the two . This is the well-known BehrensÐFisher The article is organized as follows. In the next section we problem for which Behrens (1929) proposed a solution and give a brief introduction to GPQs and GCIs and also introduce the subclass of FGPQs. In Section 3 we prove a theorem to the effect that under fairly general conditions, GCIs obtained Jan Hannig is Assistant Professor, Department of , Colorado State from FGPQs have correct asymptotic coverage. Some familiar University, Fort Collins, CO 80523 (E-mail: [email protected]). Hari Iyer is Professor, Department of Statistics, Colorado State University, Fort examples are considered, and the application of the theorem is Collins, CO 80523 (E-mail: [email protected]). Paul Patterson is Gradu- illustrated. We also discuss an example where the conditions of ate Student, Department of Statistics, Colorado State University, Fort Collins, the theorem are not satisfied. CO 80523, and Mathematical , USFS-Rocky Mountain Research Station (E-mail: [email protected]). The authors thank the reviewers and the associate editor for their constructive comments and valuable suggestions. In the Public Domain Hannig’s research was supported in part by National Science Foundation grant Journal of the American Statistical Association DMS-05-04737, and Iyer’s research was partially supported by National Park March 2006, Vol. 101, No. 473, Theory and Methods Service contract H2380040002 T.O. 04-51. DOI 10.1198/016214505000000736

254 Hannig, Iyer, and Patterson: Fiducial Generalized Confidence Intervals 255

Section 4 is devoted to some general methods for construct- Property (GPQ2) implies that Rθ (s, s,ξ)= f (s,θ) for some ing FGPQs. First, we describe a recipe for constructing FGPQs function f . It turns out that the subclass of GPQs for which and discuss the scope of application of this recipe. This is a re- f (s,θ) is a function of θ only, say f (s,θ)= f (θ), have a spe- formulation, using the notation of this article, of the recipe for cial connection with fiducial inference. Generalized confidence constructing FGPQs given by Iyer and Patterson (2002). The regions obtained using such GPQs are not guaranteed to be in- procedure is illustrated with some examples of GPQs not pre- tervals unless the function f (θ) is invertible. In this case one viously discussed in the literature. We show that these GPQs may assume that f (θ) is identically equal to θ without loss in (actually FGPQs) satisfy the conditions of the main theorem, generality. Such GPQs exist in practically every application that so that the resulting GCIs are guaranteed to have the correct we have considered. This leads us to single out the following subclass of GPQs. frequentist coverage asymptotically. In Section 5 we introduce  two additional methods for constructing FGPQs that extend the Definition 2. AGPQRθ (S, S ,ξ)for a parameter θ is called of problems for which GCIs can be developed. We il- a fiducial generalized pivotal quantity (FGPQ) if it satisfies the lustrate the application of these methods with new confidence following two conditions: intervals for some well-known problems.  (FGPQ1) The conditional distribution of Rθ (S, S ,ξ), con- In Section 6 we discuss connections between GPQs and ditional on S = s,isfreeofξ. k fiducial inference. We also touch on nonuniqueness issues as- (FGPQ2) For every allowable s ∈ R , Rθ (s, s,ξ)= θ. sociated with GCIs and fiducial intervals. We provide some Remark 1. Condition (FGPQ1) of Definition 2 is the same concluding remarks in Section 7. as condition (GPQ1) of Definition 1, but condition (FGPQ2) 2. GENERALIZED PIVOTAL QUANTITIES AND is a stronger version of condition (GPQ2) in the definition of GENERALIZED CONFIDENCE INTERVALS aGPQ. In Section 6 we show that if R is a FGPQ for θ, then fre- 2.1 Generalized Pivotal Quantities θ quentist probability intervals associated with the distribution The definition that we present for a GPQs is superficially dif- of Rθ have a corresponding interpretation as fiducial proba- ferent from Weerahandi’s (1993) definition but is identical in bility intervals associated with the parameter θ. It is for this spirit. Our definition enables us to more clearly explain the con- reason that we use the term FGPQ to describe members of this nection between ordinary pivotal quantities and GPQs and also important subclass. facilitates a mathematically rigorous discussion of the asymp- 2.2 Some Well-Known Generalized Pivotal Quantities totic behavior of the resulting GCIs. Example 1: The BehrensÐFisher problem. Consider m iid S ∈ Rk = 2 Definition 1. Let denote an observable random vector observations Xi, i 1,...,m, from N(µX,σX) and n iid ob- = 2 whose distribution is indexed by a (possibly vector) parameter servations Yj, j 1,...,n, from N(µY ,σY ), where µX,µY ,σX, p ξ ∈ R . Suppose that we are interested in making inferences and σY are unknown . The problem is to obtain con- q  ¯ ¯ about θ = π(ξ) ∈ R (q ≥ 1). Let S represent an independent fidence bounds for the difference θ = µX − µY .LetX and Y   2 2 copy of S.Weuses and s to denote realized values of S and S . denote the sample means and let SX and SY denote the sample  ¯ ∼ 2 ¯ ∼ AGPQforθ, denoted by Rθ (S, S ,ξ) (or simply Rθ or R variances for the two samples. Then X N(µX,σX/m), Y  2 − 2 2 ∼ 2 − − 2 2 ∼ when there is no ambiguity) is a function of (S, S ,ξ) with the N(µY ,σY /n), (m 1)SX/σX χ (m 1), and (n 1)SY /σY 2 − S = ¯ ¯ 2 2 following properties: χ (n 1). The (X, Y, SX, SY ) is complete and suf- ficient for ξ = (µ ,µ ,σ ,σ ). R S S X Y X Y (GPQ1) The conditional distribution of θ ( , ,ξ), condi- A nontrivial exact confidence interval, in the frequentist S = tional on s, is free of ξ. sense, is unavailable for this problem (see, e.g., Linnik 1968). k (GPQ2) For every allowable s ∈ R , Rθ (s, s,ξ) depends A solution to this problem was put forward by Behrens (1929); on ξ only through θ. later, Fisher (1935) showed that this solution could be derived very simply using the fiducial argument. The fiducial distri- Note that (GPQ2) uses s in both the first and the second bution of µ − µ is known as the BehrensÐFisher distribu- argument positions of R (·, ·,ξ). This is an important aspect X Y θ tion. Critical values for this distribution have been tabulated by of the definition of GPQ and explains both its similarity to R S S Sukhatme (1958). The quantiles of this distribution lead to the and difference from an ordinary pivotal quantity. θ ( , ,ξ) lower and upper confidence bounds for µ − µ . S S X Y would indeed be an ordinary pivotal quantity based on ( , ) Many other approximate confidence interval procedures if in condition (GPQ2) we instead had the requirement that for this problem have been discussed in the literature (see, R  θ (s, s ,ξ)depend on ξ only through θ. The point is that we do e.g., Welch 1947; Satterthwaite 1942, 1946; Cochran 1964;  not intend to actually observe a realization of S and thus are not Graybill and Wang 1980). Weerahandi (1993) derived a GCI S S seeking a pivotal quantity for θ based on both and .How- for µX − µY and noted that it is the same as the BehrensÐFisher ever, heuristic reasoning suggests that under appropriate condi- solution. He derived the GCI for θ = µX − µY by starting with S S ¯ − ¯ 2 2 tions, and will be sufficiently close to each other, and hence the statistics X Y, SX, and SY . He justified this based on in- we can substitute S in both argument positions in Rθ (·, ·,ξ)and arguments. The GPQ proposed by Weerahandi is use the distribution from condition (GPQ1) to make approxi- R = R(S, S,ξ) mate confidence statements about the quantity Rθ (s, s,ξ), and hence about θ. Theorem 1 in Section 3 confirms that our heuris- σ 2S2 /(mS2) + σ 2S2 /(nS2) 1/2 = (X¯  − Y¯  − θ) X X X Y Y Y , tic reasoning is indeed valid. 2 + 2 σX/m σY /n 256 Journal of the American Statistical Association, March 2006

¯  ¯  2 2 ¯ ¯ − S S where (X , Y , SX , SY ) is an independent copy of (X, Y, the quantity θ Rθ ( , ,ξ)is automatically a generalized test 2 2 R S S R S SX, SY ). Although ( , ,ξ)is not itself a FGPQ for θ, ( , variable (Tsui and Weerahandi 1989) for testing the null hy-  ¯ ¯  S ,ξ)= (X − Y) − R(S, S ,ξ) is a FGPQ for θ. Moreover, it pothesis H0 : θ ≤ θ0 versus the alternative θ>θ0, where θ0 is a leads to the same confidence interval for θ as does the FGPQ user-specified value. Results that we prove in this article con- Rθ defined by cerning coverage properties of GCIs can be restated so that they become statements about type I error rates of general- S2 S2 R = (X¯ − Y¯ ) − (X¯  − µ ) X − (Y¯  − µ ) Y . ized tests or about the correctness of generalized p values. We θ X 2 Y 2 SX SY do not elaborate further on this here, but a detailed treatment of generalized tests will be the subject of a forthcoming arti- This latter FGPQ can be derived by a direct application of The- cle. orem 2 of Section 4. Example 2: Balanced one-way random-effects model. Con- 3. MAIN RESULT: FREQUENTIST JUSTIFICATION sider the one-way random-effects model Xij = µ + Ai + eij, FOR GENERALIZED CONFIDENCE INTERVALS i = ,...,a j = ,...,n A ∼ ( ,σ2) e ∼ ( ,σ2) 1 , 1 , where i N 0 A , ij N 0 e , As mentioned earlier, the literature abounds with examples and {A } and {e } are all mutually independent. Define i ij of GCIs, but no theoretical results exist that guarantee satis- n factory performance of these intervals. The only evidence of ¯ 1 Xi· = Xij, their acceptability for practical use is through simulation stud- n j=1 ies suggesting that almost all reported GCI methods appear to a have coverage probabilities close to their stated values. 1 X¯ ·· = X¯ , In the next section we state and prove a theorem that appears a i i=1 to be the first result that guarantees, under some fairly mild con- a ¯ ¯ 2 ditions, the asymptotic correctness of the coverage probability n = (X · − X··) S2 = i 1 i , of a GCI. Although we do not discuss generalized tests here, B a − 1 it is worth noting that, as a corollary to the theorem, one can and also establish asymptotic correctness of type I error rates for a n − ¯ 2 generalized tests of hypotheses. i=1 j=1(Xij Xi·) S2 = . Let us consider a parametric statistical problem where we W a(n − 1) observe X1,...,Xn, whose joint distribution belongs to some 2 2 ∈ ⊂ Rp Thus SB is the between-groups mean square and SW is the family of distributions parameterized by ξ .Let within-groups mean square. S = (S1,...,Sk) denote a statistic based on the Xi’s. In theory,   Suppose that one is interested in a confidence interval for we can consider an independent copy of X1,...,Xn and denote = 2  S θ σA . Approximate confidence interval procedures, such as the statistic based on the Xi ’s by . Finally, suppose that a  the TukeyÐWilliams procedure (Tukey 1951; Williams 1962), function Rθ (S, S ,ξ) is available that is a FGPQ for a scalar have been proposed for this problem. Weerahandi (1993, parameter θ = π(ξ). We first consider some notation and as- 2 p. 904) pointed out that a confidence bound for σA may be sumptions. obtained by inverting a generalized test (see Weerahandi 1991, p. 152) of the null hypothesis H : σ 2 = δ versus a one-sided Assumption A. 0 A ∈ Rk alternative. It is easily verified that this process results in a con- 1. Assume that there exists t(ξ) such that 2 fidence statement of the form Rβ <σ < ∞, where Rβ is the √ D A  −  − → = β- of the conditional distribution of the quantity R n S1 t1(ξ), . . . , Sk tk(ξ) N (N1,...,Nk), given by where N has a nondegenerate multivariate normal distrib- S2 S2 ution. R = B (σ 2 + nσ 2) − W σ 2, nS2 e A nS2 e 2. Assuming the existence and continuity of second partial B W   derivatives with respect to s of Rθ (s, s ,ξ),wehavethe 2 2 2 2 conditional on SB and SW , where (SB , SW ) is an independent following one-term Taylor expansion with a remainder 2 2 R 2 copy of (SB, SW ). Observe that is a FGPQ for σA . term: Two other FGPQs may be defined that are related to the fore- going FGPQ but are guaranteed to take on only nonnegative k R s S = g s + g s S − t values. These are |R| and max(0, R). Asymptotically, both of θ ( , ,ξ) 0,n( ,ξ) 1,j,n( ,ξ) j j(ξ) R 2 2 j=1 these modified FGPQs are equivalent to as long as σA and σe  are nonzero. We expect max(0, R) to have better small-sample + Rn(s, S ,ξ). (1) properties. Here 2.3 Fiducial Generalized Pivotal Quantities and Generalized Test Variables g0,n(s,ξ)= Rθ s, t(ξ), ξ , FGPQs have another useful property. Suppose that θ is a ∂ g s = R s s S S 1,j,n( ,ξ)  θ ( , ,ξ) , scalar parameter. If Rθ ( , ,ξ) denotes a FGPQ for θ, then ∂sj s=t(ξ) Hannig, Iyer, and Patterson: Fiducial Generalized Confidence Intervals 257

and for the one-way random model. However, our purpose here is to illustrate the application of Theorem 1 by considering some R (s, s,ξ) n familiar examples. To save space, we illustrate the process for k k 2 the one-way random model. The conditions may be verified for   1 ∂ = s − ti(ξ) s − tj(ξ) R (s, s˜,ξ), i j 2 ∂ss θ the BehrensÐFisher problem using analogous arguments. i=1 j=1 i j Proposition 1. In the one-way random model, the (1 − where s˜ lies on the line segment connecting t(ξ) and s. α)100% GCI for σ 2 based on the FGPQ discussed in Sec- Suppose that A ⊂ Rk is an open set containing t(ξ) with A tion 2 has asymptotically 100(1 − α)% frequentist coverage as the following properties: a →∞. (a) The functions g1,j,n(s,ξ) converge uniformly in ∈ A = s to a function g1,j(s,ξ)continuous at s t(ξ). Proof. We need to verify the conditions of Theorem 1. The ∈ A k | | 2 (b) For all s , j=1 g1,j(s,ξ) > 0. generalized pivot for σA given in Section 2 may be expressed as ∂2  (c) There is M < ∞ such that |   Rθ (s, s ,ξ)| < M ∂si sj S2 S2 for all n and all (s, s) ∈ A × A. R(S , S , S , S ,σ2,σ2) = B (σ 2 + nσ 2) − W σ 2. B W B W e α 2 e α 2 e nSB nSW We are now ready to state and prove the following theorem, the proof of which is given in Appendix A. We consider n fixed and a →∞. Combining well-known facts from the theory of linear models with a simple calculation, we Theorem 1. Suppose that Assumption A holds and that √ D −2 − 2 + 2 −1 −2 − −2 → for each fixed s, n, and γ ∈ (0, 1), there exists a real num- get a(SB (σe nσα ) , SW σe ) (N1, N2), where ber Cn(s,γ)such that N1 and N2 are independent nondegenerate Gaussian random variables. Thus we can set g = (s2 −s2 )/n, g = s2 (σ 2 + lim P R (s, S,ξ)≤ C (s,γ) = γ. (2) 0,a B W 1,1,a B e →∞ ξ θ n 2 =− 2 2 = n nσα )/n, g1,2,a sW σe /n, and Rn 0. The various conditions of Assumption A are immediately verified, and the proposition Then limn→∞ Pξ (Rθ (S, S,ξ) ≤ Cn(S,γ)) = γ . In partic- ular, because Rθ (S, S,ξ) = θ, it follows that the interval follows from Theorem 1. −∞ ≤ S <θ Cn( ,γ) is a one-sided confidence interval for θ In particular, because |R| and max(0, R) are asymptotically with asymptotic coverage probability equal to γ . R 2 2 equivalent to (provided that σA and σe are nonzero), we may Remark 2. The various conditions stated in Assumption A conclude that they also will lead to GCIs with correct asymp- could be weakened. For example, we do not have to assume totic coverage. that the limiting N in Assumption A.1 is nor- Next we discuss an example where the conditions of the mal. The proof of Theorem 1 would then have to be modified proposition do not hold. It is known that neither fiducial inter- accordingly. (For an exact statement of the more general ver- vals nor GCIs have satisfactory frequentist performance in this sion of Assumption A under which Theorem 1 still holds, see example. Hannig 2005.) Example 4: A situation where conditions of Theorem 1 Remark 3. Although we have stated the result of Theorem 1 do not hold. Consider k independent samples Xi1,...,Xin iid in terms of coverage probability of a GCI, the same result car- 2 = N(µi,σ ), i 1,...,k. Suppose that we are interested in one- ries over to generalized p values associated with generalized = k 2 = sided and two-sided confidence intervals for θ i=1 µi tests of hypotheses. T T  µ µ, where µ = (µ1,...,µk) .LetX be an independent X ¯ ¯  Remark 4. The result is easily generalized to the vector pa- copy of the vector . Furthermore, let Xi· and Xi· denote 2 2 rameter case. We state a vector parameter version of Theorem 1 the sample means of the ith sample and let Sν and Sν be the in Appendix B. pooled estimates of σ 2 with ν = k(n − 1) degrees of freedom, based on X and X, respectively. The obvious generalized pivot Remark 5. The statement of Theorem 1 holds for each (an FGPQ) for θ, given by fixed ξ. Sometimes it is of interest to establish a stronger con- clusion, k 2  2 ¯ Sν ¯  R (X, X , µ,σ ) = X · + (X · − µ ) , (4) lim sup Pξ Rθ (S, S,ξ)≤ Cn(S,γ) − γ = 0, (3) θ i  i i n→∞ Sν ξ∈ 0 i=1 where 0 ⊂ . A careful review of the proof of Theorem 1 does not lead to intervals for θ with good frequentist properties. reveals that (3) holds as long as Assumption A holds uniformly This same phenomenon is observed in more general problems ∈ in ξ 0. involving quadratic functions of fixed effects in mixed linear Example 3: Verification of assumptions of Theorem 1. Con- models. For instance, Daniels et al. (2005), in their work on sider the two examples of Section 2, the BehrensÐFisher prob- GCIs for a quadratic function of fixed effects in a two-factor lem and the balanced one-way random model. We may show , were led to consider alternative GPQs because that the GPQs in these two examples satisfy the assumptions of they found that the obvious FGPQ did not lead to satisfactory Theorem 1. Hence the resulting GCIs will have asymptotically intervals. correct coverage probabilities. This is, of course, well known It is easy to see that the assumptions of Theorem 1 are satis- for the BehrensÐFisher problem and may be directly verified fied for the FGPQ in (4) if and only if θ>0. Toward this end, 258 Journal of the American Statistical Association, March 2006 notice that k-dimensional pivotal quantity that bears an invertible pivotal relationship with the parameter ξ (see Definition 4). Nearly R (X, X,µ,σ2) θ every GCI in the published statistical literature may be ob- k k ¯ tained using this recipe, and the construction always yields ¯ 2 2SνXi· ¯   2 = X · + (X · − µ ) + R (X, X , µ,σ ), i σ i i n FGPQs. For instance, Burdick, Borror, and Montgomery (2005) i=1 i=1 and Burdick, Park, Montgomery, and Borror (2005) reported where GCIs for parameters of interest in Gage R&R studies using this  2 recipe. In Theorem 2 we give a reformulation of their construc- Rn(X, X , µ,σ ) tion using the notation of this article. k 2 Sν  2 −1 −1  Because the method underlying Theorem 2, as well as its = (X¯ − µ ) + 2S X¯ ·(S − σ )(X¯ − µ ) . 2 i· i ν i ν i· i generalizations given in Theorems 3 and 4, is inspired by, and = Sν i 1 very much related to, Fraser’s (1961, 1966, 1968) development Hence all assumptions of Theorem 1 are satisfied as long of structural inference, we refer to this method as the structural = as σ>0 and θ>0. But if θ 0, then Assumption A.2(b) method for constructing FGPQs. is clearly violated. Notice that, regardless of the value of θ, It is useful to first record the following definitions. Rθ > 0 almost surely. Therefore, if θ = 0, then any lower con- S = ∈ S ⊂ Rk fidence bound based on the distribution of Rθ will be positive, Definition 3. Let (S1,...,Sk) be a k-dimen- and hence will miss the true value θ = 0. sional statistic whose distribution depends on a p-dimensional This problem also causes difficulties when one attempts to parameter ξ ∈ . Suppose that there exist mappings f1,...,fq, k p obtain a fiducial interval for θ. Wilkinson (1977) considered with fj : R ×R → R, such that if Ei = fi(S; ξ),fori = 1,...,q, this example in the context of fiducial inference and noted that then E = (E1,...,Eq) has a joint distribution that is free of ξ. the point µ = 0 is a special point in the space of {µ}. Barnard We say that f(S,ξ) is a pivotal quantity for ξ, where f = (1982) also pointed out difficulties that arise when the fiducial ( f1,...,fq). distribution of µ is used to derive fiducial distributions for func- S tions of µ that are not one-to-one, such as µ2. We consider this Definition 4. Let f( ,ξ) be a pivotal quantity for ξ as de- = ∈ S example further in Example 5, where we propose a new FGPQ scribed in Definition 3 with q p. For each s , define that does not suffer the problems at θ = 0 discussed here for the E(s) = f(s, ). Suppose that the mapping f(s, ·) : → E(s) is naive FGPQ given in (4). invertible for every s ∈ S. We then say that f(S,ξ) is an in- vertible pivotal quantity for ξ. In this case we write g(s, ·) = 4. A STRUCTURAL METHOD FOR CONSTRUCTING (g1(s, ·),...,gp(s, ·)) for the inverse mapping, so that whenever FIDUCIAL GENERALIZED PIVOTAL QUANTITIES e = f(s,ξ),wehaveg(s, e) = ξ. As mentioned earlier, during the past few years, the idea of The following theorem gives a recipe for constructing GCIs and generalized tests have been used by many authors FGPQs based on the structural method when an invertible piv- to obtain useful inference procedures in nonstandard problems. otal quantity exists. Although Weerahandi (1993) provided a few examples illus- k trating the application of GPQs, he did not provide a systematic Theorem 2. Let S = (S1,...,Sk) ∈ S ⊂ R be a k-dimen- approach for finding GPQs. As a matter of fact, Weerahandi sional statistic whose distribution depends on a k-dimensional (1993) stated that “the problem of finding an appropriate piv- parameter ξ ∈ . Suppose that there exist mappings f1,...,fk, k k otal quantity is a nontrivial task...[and] ...furtherresearchis with fj : R × R → R, such that f = ( f1,...,fk) is an invertible necessary to develop simple methods of constructing general- pivotal quantity with inverse mapping g(s, ·). Define ized pivotals for classes of general problems, and this is beyond  the scope of this article.” Rθ = Rθ (S, S ,ξ) Until recently, a general method for constructing GPQs was   = π g1 S, f(S ,ξ) ,...,gk S, f(S ,ξ) not available in the literature, and each particular problem ap-   peared to require some ingenuity in constructing an appropriate = π g1(S, E ),...,gk(S, E ) , GPQ or a test variable. Chiang (2001) proposed the method of   surrogate variables for deriving approximate confidence inter- where E = f(S ,ξ) is an independent copy of E. Then Rθ is vals for functions of variance components in balanced mixed aFGPQforθ = π(ξ). When θ is a scalar parameter, an equal- linear models. Iyer and Mathew (2002) pointed out that his in- tailed two-sided (1 − α)100% GCI for θ is given by Rθ,α/2 ≤ tervals are identical to GCIs. Nevertheless, Chiang had indi- θ ≤ Rθ,1−α/2.HereRθ,γ = Rθ,γ (s) denotes the 100γ th per- rectly provided a systematic method for computing GCIs for centile of the distribution of Rθ conditional on S = s. One- the class of problems that he considered; however, he did not sided generalized confidence bounds are obtained in an obvious extend his method to any other class of problems, and also did manner. not discuss the connection between his method and either Weer- E = S ahandi’s GCIs or Fisher’s fiducial intervals. Proof. Note that, because the distribution of f( ,ξ) R S S In an unpublished technical report, Iyer and Patterson (2002) does not depend on ξ, the distribution of ( , ,ξ) given S = R S S = formulated a method for constructing GPQs and generalized s does not depend on ξ. In addition, θ ( , ,ξ) θ by test variables for a parameter θ = π(ξ) and proved that the definition of g and f. Therefore, Rθ satisfies the requirements method works for the class of problems where there exists a for it to be a FGPQ for θ. Hannig, Iyer, and Patterson: Fiducial Generalized Confidence Intervals 259

T Remark 6. Weerahandi (2004) described an approach termed µ = (µ1,...,µk) lies. We consider the problem of obtaining the substitution method, discussed also by Peterson, Berger, and a lower confidence bound for θ. n Weerahandi (2003), for constructing generalized test variables For i = 1,...,k,letX¯ · = X /n, the mean for the ith i j=1 ij and GPQs in certain classes of problems. This method is essen- 2 k ¯ 2 2 k n sample. We define S = = X · and S = ( = = (Xij − tially the same as the construction given by Iyer and Patterson H i 1 i W i 1 j 1 X¯ ·)2)/(k(n − 1)). We first observe that S2 and S2 are indepen- (2002). i W H − 2 2 ∼ 2 2 2 ∼ dent. In addition, ((n 1)kSW )/σ χ − and (nSH)/σ [ ≤ R S ]= [ [R S S k(n 1) Remark 7. Because PS θ θ,γ ( ) PS PS θ ( , , χ2 where χ2 denotes the noncentral chi-squared distrib- ≤ |S]≤ ] k,nθ/σ2 k,λ ξ) θ γ , it follows that a GCI for a parameter θ based ution with noncentrality parameter λ. We use the definition of on the GPQ Rθ is frequentist exact if and only if the distribu-  noncentrality as given by Rao (1973) that is consistent with the tion of U(S; θ)= PS [R (S, S ,ξ)≤ θ|S] is uniform on [0, 1] θ definition used by software packages SAS and R. Some authors for all θ. In particular, U(S; θ) is an ordinary pivotal quantity define the noncentrality parameter to be half the noncentrality for θ. It is also easy to see that U(S; θ) will have a uniform defined by Rao. [0, 1] distribution whenever the inequality R ≤ θ can be put θ Let F (x; λ) denote the value of the noncentral chi-squared in the form q(S,ξ) ≤ q(S,ξ) for some function q(·, ·).This ν latter sufficient condition is easy to check in many applica- cumulative distribution function (cdf ) with ν degrees of free- tions. An interesting example of this was reported by Roy and dom (df ) and noncentrality λ evaluated at x.Forx > 0, if ; = ∈ ; = Mathew (2005), who for a given constant t obtained a gener- Fν(x λ) t (0, 1), then we write Qν(t x) λ, so that Qν is alized confidence bound for τ = (t − µ)/θ based on a right- the inverse of Fν when Fν is viewed as a function of λ, keep- censored sample from a shifted exponential distribution with ing ν and x fixed. The existence of Qν is guaranteed by the monotonicity of F viewed as a function of λ. In what follows density f (x; θ,µ)= (1/θ) exp(−(x − µ)/θ)I[µ,∞)(x). They ob- ν ; ; served that the resulting GCI had exact frequentist coverage. it is important to notice that if t < Fν(x 0), then Qν(t x)>0 One might note that an ordinary pivotal quantity is available in is strictly decreasing continuous function, and if t ≥ Fν(x; 0), ˆ their problem and is given by the cdf of T = (t −ˆµ)/θ, where then Qν(t; x) = 0 by definition. We also write Gν(t; λ) = x ˆ µˆ and θ are the maximum likelihood estimators (MLEs) for for the inverse of Fν when Fν is viewed as a function of x, = 2 2; 2 µ and θ. keeping ν and λ fixed. Define E1 Fk(nSH/σ nθ/σ ) and E = ((n − 1)kS2 )/σ 2. Clearly, E and E are independent. Remark 8. In the case of models admitting a group struc- 2 W 1 2 Furthermore, E has a uniform distribution on (0, 1). Thus it ture where the pivotal mapping corresponds to group multipli- 1 follows that the statistic S = (S2 , S2 ) and the parameter (θ, σ 2) cation, the invertibility of the pivotal relationship is guaranteed W H have an invertible pivotal relationship. Applying the structural because of the invertibility of the group operation. As an il- lustration, consider the complete sufficient statistic (X¯ , S)for method of Theorem 2, we get the following FGPQ for θ: 2 the normal family N(µ, σ ). Consider the mapping (E1, E2) = 2 2 R 2 nS nθ nS f((X¯ , S), (µ, σ )) = ((X¯ − µ)/σ, S/σ ). First, note that all three R = σ Q F H ; ; H , θ k k 2 2 R sets E, X , and can be identified with R × R+, where R+ is n σ σ σ 2 the set of all positive real numbers. For (u , u ) and (v , v ) in 2 1 2 1 2 2 SW + where R 2 = σ . (5) G = R×R , define a binary operation ◦ by (u1, u2)◦(v1, v2) = σ 2 SW (u1 + u2v1, u2v2). It is easily verified that G is a group with this binary operation. The identity element is (0, 1), and the As usual, S2 and S2 are independent copies of S2 and S2 . − W H W H inverse of (u1, u2) is ( u1/u2, 1/u2). The pivotal relation- It is a known fact (see Johnson and Kotz 1970, chap. 28), that = − ◦ ¯ ship may be expressed as (E1, E2) ( µ/σ, 1/σ ) (X, S). if λ →∞, then the noncentral chi-squared cdf with noncentral- The invertibility of this relationship is guaranteed because + + ¯ −1 ity parameter λ approaches the cdf of N(k λ,2(k 2λ)).The (X, S) = (−µ/σ, 1/σ ) ◦ (E , E ) = (µ, σ ) ◦ (E , E ) = (µ + − 1 2 1 2 error of the approximation is of the order O(λ 1/2) uniformly σ E ,σE ). (For further discussion of this and other related is- 1 2 in x. Thus one can verify the conditions of Theorem 1 directly sues, see Fraser 1961; Dawid and Stone 1982.) as long as θ>0. We now discuss several examples that illustrate how the The more interesting case is θ = 0. Just as before, the con- structural method of Theorem 2 may be applied to obtain ditions of Theorem 1 do not apply. However, this time we can FGPQs in some important applications. The next example is prove directly that a lower confidence bound on θ derived from a continuation of Example 4 where the obvious FGPQ did not the FGPQ in (5) has the correct asymptotic coverage. Toward yield satisfactory GCIs. = 2 2; 2 this end, first notice that U Fk(nSH /σ nθ/σ ) has uniform − Example 5: Continuation of Example 4. We continue with distribution on (0, 1). Then fix α and consider a 100(1 α)% = the notation of Example 4. Recall that Xij, j = 1,...,n, lower confidence bound L. The true value θ 0 is included 2 [ ∞ R   i = 1,...,k,arek independent samples where Xij ∼ N(µi,σ ). in the interval L, ) if and only if P( θ (sH, sW , SH, SW , P = k 2 ξ)= 0)>α. Notice that S2 → σ 2, and therefore P(R (s , s , The parameter of interest is θ i=1 µi . This parameter pro- W θ H W   = 2 2 ≤ → vides a measure of the extent to which the k normal means SH, SW ,ξ) 0)>αif and only if nSH/SW Cn, where Cn deviate from the null hypothesis that the means are all 0. This Gk(1 − α; 0) by Slutsky’s theorem. Thus Pθ (0 ∈[L, ∞)) = 2 2 ≤ → 2 2 ≤ − ; = − θ appears in the noncentrality parameter of the distribution√ of Pθ (nSH/SW Cn) Pθ (nSH/σ Gk(1 α 0)) 1 α, the for testing this null hypothesis. Note that θ is establishing that the FGPQ of (5) leads to a generalized lower the radius of a sphere centered at 0 on whose surface the vector confidence bound for θ with correct asymptotic coverage. 260 Journal of the American Statistical Association, March 2006

We have examined the small-sample performance of the fore- submitting an approval request demonstrate that certain equiv- going generalized confidence bound in a small simulation study alence criteria are satisfied. One such criterion, the average and found the coverage probabilities to be close to the nominal bioequivalence criterion, requires that the ratio θ = µT/µR be value. The next example has important applications in industrial sufficiently close to 1, where µT denotes the mean response to and quality improvement. the test drug and µR denotes the mean for the reference drug. A confidence interval for the ratio θ = µ /µ is useful in this Example 6: Proportion conformance. Consider a popula- T R situation. (Readers interested in details may refer to U.S. FDA tion characteristic whose distribution may be assumed to be 2001.) N(µ, σ 2) (µ, σ are unknown). Let θ denote the proportion A key response variable in such studies is the area under the of this population contained in the interval (C , C ), where 1 2 curve (AUC) relating the plasma drug concentration in a patient C < C are real numbers (C may be negative infinity and 1 2 1 to the elapsed time after the drug is administered. In accordance C may be positive infinity). In many practical applications it 2 with the FDA guidelines, the analysis of AUC is carried out is of interest to obtain a lower or an upper confidence bound using the log scale. This is because the distribution of AUC for θ. In the manufacturing of machine parts, a part is said to is typically well modeled by a lognormal distribution. So the meet specifications provided that its performance characteris- parameter of interest is the ratio of means of two lognormal tic is contained in a prespecified interval (C , C ) determined 1 2 distributions. This approach, termed “average bioequivalence,” by engineering requirements. One would like to be assured that involves the calculation of a 90% confidence interval for the θ is large, so a lower confidence bound for θ would be of in- ratio of the averages of test and reference products. To establish terest. In environmental applications it is often of interest to bioequivalence, the calculated confidence interval should fall ensure that pollutant levels in soil, water, or air are well below within a bioequivalence limit, usually 80Ð125% for the ratio of an amount determined to be safe by the U.S. Environmental the product averages. Protection Agency. Here θ may represent the proportion of all possible water samples in which the concentration of arsenic The experimental design of choice in bioequivalence stud- exceeds 5 µg/L. One would like to be assured that this propor- ies is a two-period crossover design with an adequate washout tion is small and so an upper confidence bound for θ would period to minimize carryover effects. But a two-group design be of interest. In the case of arsenic concentrations, it may be (also called a parallel design) is the more appropriate design more reasonable to use a lognormal model, but the problem when the half-lives of drugs being tested is very long, and this can be restated using the log scale. The problem of comput- is recognized in the FDA guidelines. In this example we assume ing confidence bounds for θ has been considered by Chou and that a two-group design is used for the bioequivalence study. = Owen (1984) and also by Wang and Lam (1996). These au- Let Yij, j 1,...,n1, denote independent random variables = ∼ 2 = = thors have provided methods for computing approximate lower such that Xij ln(Yij) N(µi,σi ) for j 1,...,ni, i 1, 2. = − + 1 2 − 2 confidence bounds for θ. Here we exhibit a FGPQ for θ de- Then θ exp(µ1 µ2 2 (σ1 σ2 )) is the ratio of the means = rived using the structural method of Theorem 2. Suppose that of the two lognormal populations. When σ1 σ2, this expres- 2 sion simplifies, and we get θ = exp(µ − µ ). The problem of an iid sample X1,...,Xn is available from the N(µ, σ ) distri- 1 2 − − = C2 µ − C1 µ obtaining a confidence interval for θ is straightforward in this bution. The parameter of interest is θ ( σ ) ( σ ). S = ¯ 2 ¯ 2 = n − special case; however, the problem does not admit an exact so- Let (X, S ), where X is the sample mean, S i=1(Xi ¯ 2 − = 2 lution in the general case. X) /(n 1) is the sample variance, and ξ (µ, σ ). Applying n −R −R i = , X¯ = i X /n S2 = C2 µ C1 µ Let us define, for 1 2, i j=1 ij i and i the structural method, we get Rθ = ( R ) − ( R ) n σ σ ( i (X − X¯ )2)/(n − 1). The structural method may be ap- R = σ S R = ¯ − ¯  − S j=1 ij i i as a FGPQ for θ, where σ S and µ X (X µ) S . plied, and we obtain the following FGPQ for θ: It is straightforward to verify that the assumptions of Theorem 1 are satisfied, and hence the GCIs obtained using the proposed 1 2 2 Rθ = exp Rµ − Rµ + R − R , FGPQ have correct asymptotic coverage. Simulation results 1 2 2 σ1 σ2 pertaining to small-sample performance of the GCI for propor- where R = σ S /S, R = σ S /S, R = X¯ − (X¯  − tion conformance have been reported by Patterson, Hannig, and σ1 1 1 1 σ2 2 2 2 µ1 1 1 µ )S /S, and R = X¯ − (X¯  − µ )S /S. It is straightfor- Iyer (2004b) in an unpublished technical report. They found the 1 1 1 µ2 2 2 2 2 2 ward to check that the conditions of Theorem 1 hold. Hence the performance of the GCIs satisfactory for use in practical appli- resulting GCI has correct asymptotic coverage. cations. We have conducted a simulation study to evaluate the perfor- The method of this example is generalizable to more complex mance of the GCI for θ in small samples. Results of our study situations. Of particular interest is GCIs for the proportion of indicate that the GCI approach may be recommended for prac- conformance in the one-way random-effects model. This has tical applications. also been studied in detail by Patterson et al. (2004b). Krishnamoorthy and Mathew (2003) have discussed general- The next example illustrates a practical problem arising in ized inference for a single lognormal mean, and Krishnamoor- pharmaceutical statistics where GCIs provide a useful solution. thy (unpublished work) has considered generalized inference for the difference between two lognormal means. The general- Example 7: Average bioequivalence. In bioequivalence stud- ized pivotal quantities proposed by them are the same as what ies comparing a test drug to a reference drug, it is of interest one would get by applying the structural method of Theorem 2. to compare the mean responses of the two drugs to ensure that they are (more or less) equally effective. With this in mind, the In the next section we generalize the recipe of Theorem 2 in U.S. Food and Drug Administration (FDA) requires that the lab two different ways. Hannig, Iyer, and Patterson: Fiducial Generalized Confidence Intervals 261

5. TWO GENERALIZATIONS OF THE STRUCTURAL these pivotal quantities. More generally, whenever multiple piv- METHOD FOR FIDUCIAL GENERALIZED otal quantities, P1,...,Pm, are available for a parameter θ,any PIVOTAL QUANTITIES function of the Pi is again a pivotal quantity for θ. The same ob- The recipe given in Theorem 2 is best suited for the case servations holds for GPQs but not necessarily for FGPQs. But when a complete sufficient statistic is available for the infer- if P1,...,Pm are FGPQs for θ, then certainly any linear func- m m = ence problem under consideration, although this is not a pre- tion i=1 LiPi, with i=1 Li 1, is a FGPQ for θ.Wenow ¯ = ¯ +···+ ¯ requisite. First, in the interest of using all of the information in apply the recipe of Theorem 2. Define XL L1X1 LkXk. S = 2 2 ¯ the data, we seek inference methods based on sufficient statis- Let (S1,...,Sk , XL). We have the following pivotal rela- tics. It is then appropriate to restrict oneself to procedures based tionships: on a minimal sufficient statistic. When this is also complete, the 2 (ni − 1)S dimension of the minimal sufficient statistic will equal the di- E = i ∼ χ2 , i = 1,...,k, i 2 ni−1 mension of the parameter indexing the family of distributions σi under consideration. We can now directly apply the structural and method for constructing FGPQs, provided that we can find an ¯ appropriate invertible pivotal relationship between S and ξ. XL − µ E + = ∼ N(0, 1). When the minimal sufficient statistic is not complete, we may k 1 k L2σ 2/n still find a statistic S that has an invertible pivotal relationship i=1 i i i with the parameter ξ. In this case one can apply the structural The conditions needed to apply the structural method of Theo- method to get an FGPQ. In this situation there will usually be rem 2 are satisfied, and we conclude that several choices for the insufficient statistic S with which to con- struct an invertible pivotal quantity. Some choices can dominate k 2 2 2 2 = σ L S /(niS ) other choices, but no general guidelines are available to help R = ¯ − ¯  − i 1 i i i i µ XL (XL µ) with such a choice. The following example helps illustrate this k 2 2 i=1 σi Li /ni situation. is a FGPQ regardless of the choice of L1,...,Lk as long as they Example 8: Common mean problem. Suppose that Xi1,..., 2 sum to unity. Xin are iid N(µ, σ ), i = 1,...,k. It is of interest to con- i i ¯ struct a confidence interval for the common mean µ.LetXi = We revisit this common mean problem in Example 9 and ni 2 = ni − ¯ 2 − ( j=1 Xij)/ni and Si ( j=1(Xij Xi) )/(ni 1). Clearly, again in Example 11 and introduce two more FGPQs. In Sec- S = ¯ ¯ 2 2 tion 5 we give the results of a small simulation study comparing (X1,...,Xk, S1,...,Sk ) is a minimal sufficient statistic for 2 2 selected approaches from the literature and the FGPQs defined (µ, σ1 ,...,σk ), but it is not complete. Many authors have addressed the problem of point estima- in this article. tion as well as confidence for µ in this In the remainder of this section we present two systematic ap- setting using frequentist approaches based on pivotal quanti- proaches for constructing FGPQs for situations where the min- ties. (For some recent work on this problem, see Jordan and imal sufficient statistic is not complete. The first approach is Krishnamoorthy 1996; Yu, Sun, and Sinha 1999.) Some of the a two-stage approach for constructing a FGPQ where the struc- pivotal quantities considered by these authors include tural method of Theorem 2 is invoked at each stage. The second k approach offers a very general construction and has connections =− with fiducial inference and ancillary statistics. PQ1 2 loge(Pi(µ0)), i=1 5.1 Two-Stage Construction of a Fiducial k ¯ Xi − µ Generalized Pivotal Quality PQ2 = √ , and (6) Si/ ni i=1 The first alternative is a two-stage approach, which is out- lined in the following theorem. k X¯ − µ 2 = i √ PQ3 . Theorem 3. Let ξ = (ξ1,ξ2) and suppose that the following Si/ ni i=1 conditions hold: In PQ , P (µ ) is the p value for testing H : µ = µ versus 1 i 0 0 0 (a) Assuming that ξ is known, there is a statistic, H : µ>µ using the Student t test with data from the ith group 2 a 0 S = S (ξ ), that bears an invertible pivotal relationship with ξ ; only. The statistic PQ combines evidence from the k groups 1 1 2 1 1 see Definition 4. following Fisher’s method for combining p values (see Fisher (b) There is a statistic S such that S and ξ have an invert- 1970). The combined evidence against H is quantified by the 2 2 2 0 ible pivotal relationship. area to the right of PQ1 under a chi-squared-density with k df. Values of µ that lead to a combined p value greater than α R 0 Let ξ1|ξ2 beaFGPQforξ1 obtained based on the pivotal rela- form a one-sided confidence interval with upper bound equal tionship between S1(ξ2) and ξ1 using the structural method of to ∞. R Theorem 2. Likewise, let ξ2 beaFGPQforξ2 obtained using Clearly, each P (µ) is a pivotal quantity for µ. So any func- i its pivotal relationship with S2. Then Rξ |R is a FGPQ for ξ1. tion of these is also a pivotal quantity for µ. Thus exact con- 1 ξ2 fidence regions for µ may be constructed using any one of Proof. This theorem follows directly from Theorem 2. 262 Journal of the American Statistical Association, March 2006

Example 9: Two-stage construction of a FGPQ for the com- defined by e0 = f0(s,ξ) is invertible. Write E0 = f0(s, ). mon mean problem. Continuing with the notation of Exam- We denote this inverse mapping from E0 to by g0(s, ·) = S = ¯ ¯ 2 2 · · = ple8,wehavethat (X1,...,Xk, S1,...,Sk ) is a minimal (g1(s, ),...,gp(s, )); thus we have g0(s, f0(s,ξ)) ξ for each 2 2 ∈ S sufficient statistic for (µ, σ1 ,...,σk ), but it is not complete. s . S S = ¯  ¯  2 An independent copy of is denoted by (X1,...,Xk , S1 , Remark 9. It is not necessary that E0 consist of the first p el- ...,S2). A realization of S is denoted by s = (x¯ ,...,x¯ , s2, k 1 k 1 ements of E, but we can always achieve this by relabeling if 2 2 2 ...,s ). In Theorem 3, we take ξ1 = µ, ξ2 = (σ ,...,σ ), S k ¯ ¯ 1 k necessary. Note that condition (a) simply says that f( ,ξ) is a S = n1X1 + ··· + nkXk n1 + ··· + nk S = 1(ξ2) ( 2 2 )/( 2 2 ), and 2 pivotal quantity for ξ with a continuous cdf, and condition (b) σ1 σk σ1 σk 2 2 is a partial invertibility condition similar to what was required (S1,...,Sk ). This leads to the FGPQ given by in Definition 4 for a pivotal quantity to be invertible. ¯ R +···+ ¯ R n1X1/ σ 2 nkXk/ σ 2  1 k E = = Rµ(S, S ,ξ)= Now let c (Ep+1,...,Ek) and fc ( fp+1,...,fk). Sub- n1/R 2 +···+nk/R 2 = S E = S = σ1 σk stituting ξ g0( , 0) in the equations Ej fj( ,ξ), j + ¯  R +···+ ¯  R p 1,...,k, we get the identity n1X1/ σ 2 nkXk / σ 2 − 1 k − µ . n1/R 2 +···+nk/R 2 Ec = fc S, g0(S, E0) . (7) σ1 σk

Proposition 2. Let all n1,...,nk approach infinity in such a For any fixed s ∈ S,letM(s) denote the set of e = (e1,..., = +···+ way that cj lim nj/(n1 nk) exists and 0 < cj < 1. Then ek) ∈ E satisfying the 100(1 − α)% confidence interval for µ based on the FGPQ Rµ has asymptotically 100(1 − α)% frequentist coverage. ej = fj s, g0(s, e0) , j = p + 1,...,k, (8) The proof of this proposition is given in Appendix A. Next, where e0 = (e1,...,ep), the first p coordinates of e. Thus we describe a second alternative construction of FGPQs. This M(s) is a manifold in Rk. is a more general approach that in principle is applicable to any Given S, by virtue of (7), it follows that E must lie on the parametric problem, but the properties of the resulting GCIs manifold M(S). Note that the same manifold may be defin- are much more difficult to assess. Nonetheless, we can show able using a different set of equations. We use the notation that GCIs obtained using this approach have exact frequentist B(s, e) = 0 for the equations chosen to define M(s).Insome coverage in some special situations. situations it may be possible to express the equations defin- M − = 5.2 A General Recipe for Constructing Fiducial ing (s) in the form a(s) b(e) 0 for suitably chosen map- S Generalized Pivotal Quantities pings a and b. Clearly, a( ) is an , because its distribution is not dependent on ξ. The two-stage construction is a systematic approach for ob- We now make the following assumption concerning the ran- taining a FGPQ for a number of problems where the minimal dom vector E0. sufficient statistic is not complete. In many situations it is fairly straightforward to verify that the conditions of Theorem 1 hold Assumption C. Conditional on B(s, E), E0 has a jointly con- for the two-stage FGPQ so that the resulting interval will have, tinuous distribution. at least asymptotically, the correct frequentist coverage. How- The following lemma states the multivariate version of the ever, the two-stage construction is perhaps not general enough; probability integral transform. see Example 11. In this section we provide a general method for constructing FGPQs and illustrate its application through Lemma 1. Let F1(·|B(s, E)) denote the cdf of E1 given examples. B(s, E) and Fj(·|E1,...,Ej−1, B(s, E)) denote the cdf of Ej S = ∈ S ⊂ Rk Let (S1,...,Sk) denote a statistic whose dis- given E1,...,Ej−1, B(s, E), j = 2,...,p. The conditional joint ∈ ⊂ Rp = tribution is indexed by ξ .Letθ π(ξ) be the pa- distribution of E0 given B(s, E) is completely determined by rameter of interest. Estimability considerations suggest that it the univariate cdf’s F1,...,Fp. Let the random vector U be de- is reasonable to restrict ourselves to cases where k ≥ p. When fined by k = p, the structural method of Theorem 2 is applicable. In this section we outline a general construction for obtaining FGPQs U = F E0|B(s, E) in situations where p < k (more statistics than parameters). = F1 E1|B(s, E) , F2 E2|E1, B(s, E) ,..., We call this the general structural method. This construction reduces to the structural method of Theorem 2 when p = k. Fp Ep|E1,...,Ep−1, B(s, E) . (9) We make the following assumption. The distribution of U, conditional on B(s, E), is uniform S × → Rk Assumption B. (a) There exists a mapping f : on [0, 1]p. Hence this is also its unconditional distribution. such that E = f(S,ξ)has a continuous cdf that does not depend ·|B E ·|B E on ξ.LetE = f(S × ). Write f = ( f1,...,fk), so that Ei = We denote the inverse of F( (s, )) by G( (s, )).Both fi(S,ξ)for i = 1,...,k. F and G depend on ξ, but for clarity of notation we do not (b) Let E0 = (E1,...,Ep) and f0 = ( f1,...,fp).Weas- explicitly display this dependence. sume that for each fixed s ∈ S, the mapping f0(s, ·) : → E We now state our general construction for FGPQs. Hannig, Iyer, and Patterson: Fiducial Generalized Confidence Intervals 263

Theorem 4. Suppose that Assumptions B and C are satisfied. Namely, the generalized confidence region C(S,β) for ξ with Let θ = π(ξ) be a scalar parameter. Define confidence coefficient β has frequentist coverage also equal  to β. Rθ = Rθ (S, S ,ξ) Proof. We have = π g S, G F f (S,ξ)|B(S, E) 0 . 0 0  β = P[R(s, S ,ξ)∈ C(s,β)], Then Rθ is an FGPQ for θ. by the definition of C(s,β); Proof. This is obvious by construction. = E ∈ C |B E = P g0(s, 0) (s,β) (s, ) 0 , Remark 10. By the definition of F and G, it follows that the by the definition of F and G (see Remark 10); distribution of = E ∈ A | = E   P 0 (s,β) a(s) b( ) , G F f0(S ,ξ)|B(S, E ) 0 by assumption about the form of B(s, e); E B S E = is the same as the distribution of 0 given ( , ) 0. We use this fact in the proof of Theorem 5. = P E0 ∈ A(s,β)|a(s) = b(E) , Note that the FGPQ defined here will generally depend on because E is an independent copy of E; the choice of the defining equations for M(s). An example of = P E0 ∈ A(s,β)|a(s) = a(S) , this phenomenon is given later in this section. An obvious ques- tion now is whether a theorem analogous to Theorem 1 may be because b(E) = a(S) is an identity; established that would guarantee correct asymptotic coverage = P f0(S,ξ)∈ A(S,β)|a(s) = a(S) , for GCIs based on this general construction of FGPQs. We con- by definition of E0 and assumption about A(S,β); jecture that such a result exists under fairly general conditions, but we do not have an actual theorem at this time. This con- = P ξ ∈ C(S,β)|a(s) = a(S) , jecture is supported by the examples that we have considered. S E = S A S = C S In principle, we should be able to use Theorem 1 directly as because g0( , 0) ξ and g0( , ( ,β)) ( ,β). long as we can get the conditional cdf in closed form. However, Therefore, it follows that P[ξ ∈ C(S,β)]=β. there are problems caused by the fact that the statistics B often We now illustrate the general construction of this section us- converge to the edge of their sample space. This causes var- ing two examples. We also use the first example to demonstrate ious quantities in Assumptions A to be unbounded. We might that different choices of equations for defining M(S) may re- be able to overcome this difficulty by paying special attention to sult in different (nonequivalent) FGPQs. But in this example the quantities B(S, E) used in the determination of conditional they turn out to be asymptotically equivalent, thus leading us distributions. to conjecture that under certain conditions, different choices Example 10, discussed shortly, provides an illustration of the of B(s, E) may lead to asymptotically equivalent GCIs. The difficulties outlined in the previous paragraph. This example second example is an application of the foregoing general con- also illustrates a situation where√ one would need to use a nor- struction to the common mean problem, which yields an FGPQ malization factor n rather than n; in this connection, see the different from that obtained from the two-stage construction technical report of Hannig (2005). discussed earlier. 5.3 Exact Coverage of Generalized Confidence Example 10: Nonuniqueness of FGPQ arising due to dif- Intervals in Special Cases ferent choices of B(S, E). Suppose that E = (EM, Em) and S = (X , X ). We assume that X = θE , i = M, m,0<θ∈ R, For the case where k = p, Remark 7 pointed out that GCIs M m i i E and E are distributed as the maximum and the minimum of can have exact frequentist coverage under certain conditions. M m n iid uniform[1, 2] random variates. We wish to obtain a FGPQ When p = 1 (i.e., when ξ is a scalar parameter), one can show for θ. We now show that two different FGPQs arise from two that a generalized confidence region for ξ has exact frequentist different choices of defining equations for M(S), but the result- coverage under certain circumstances even when k > 1. This is ing GCIs both have asymptotically correct coverage. the content of the next theorem, whose proof is based on ideas Take f(S,θ) = (EM, Em) = E, where EM = XM/θ and given by Dawid and Stone (1982). Em = Xm/θ. Clearly, the distribution of E is free of θ.Let E = S = Theorem 5. Suppose that ξ is a scalar parameter. Assume 0 EM, so that f0( ,ξ) XM/θ. The mapping g0 may = S E = that the manifold M(S) can be defined using an equation of be defined by θ g0( , 0) XM/EM. Substituting this in = = the form a(s) = b(e), and let R(S, S,ξ) denote the FGPQ the relation Em Xm/θ gives the identity EM/Em XM/Xm. = = obtained using the general construction. Let P[·] denote the Let s (xM, xm) and e (eM, em) denote realizations of S E = = probability measure associated with S and let P [·] denote and . Write a(s) xM/xm and u(e) eM/em. The mani-  M the measure associated with S.LetC(S,β) beasetin fold (curve) (s) on which e must lie may be described by B = − = satisfying P [R(s, S,ξ) ∈ C(s,β)]=β.LetA(s,β) denote the equation (s, e) a(s) u(e) 0. The same manifold may  B = − = the image of C(s,β) under the mapping f (s, ·). Assume that also be described using the condition (s, e) eM aem 0, 0 = A(s ,β)= A(s ,β)whenever a(s ) = a(s ). Then where a xM/xm. It is easy to check that one gets nonequiva- 1 2 1 2 lent FGPQs depending on the choice of B for defining M(s). P[ξ ∈ C(S,β)]=β. We state only the results here omitting the detailed calculations. 264 Journal of the American Statistical Association, March 2006

• FGPQ using B(s, e) = eM/em − a. Applying the general Then e must take values on the manifold M(s) defined by the construction, we get the following generalized upper con- k − 1 equations fidence bound for θ: 2 2 n 1/n ei si ek sk XM 0 = − √ − (x¯ −¯x ) θ ≤ , − − i k α2n + (1 − α)An ek+i/(ni 1) ni e2k/(nk 1) nk where A = XM/Xm. This is the same upper confidence 2 2 si sk bound obtained by considering the conditional distribu- = ti − tk − (x¯i −¯xk) for i = 1,...,k − 1, ni nk tion of the pivot XM/θ conditional on the ancillary statistic A = XM/Xm. Thus by Theorem 5, this interval has correct where ti, i = 1,...,k, are realizations of independent Student t − frequentist coverage 1 α, conditionally on A, and hence random variables with√ni −1 degrees√ of freedom. For i = 1,..., 2 2 also unconditionally. k − 1, define Bi = Ti s /ni − Tk s /nk − (x¯i −¯xk). A fidu- • B = − i k FGPQ using (s, e) eM aem. Here the general con- cial distribution for µ √may be defined as the conditional distrib- struction leads to the upper confidence bound given by ution of µ =¯x − T s2/n given B = B =···=B − = 0 k k k k 1 2 k 1 n−1 1/(n−1) B = = X (see, e.g., Fisher 1961a,b). Let (B1,...,Bk−1) and b θ< M , α2n−1 + (1 − α)An−1 (b1,...,bk−1). First, we note that the joint probability density function (pdf ) of B1,...,Bk−1,µis given by where A = XM/Xm. Unlike the earlier fiducial interval, which was based on conditioning on an ancillary statis- fB1,...,Bk−1,µ(e1,...,ek−1, ek) tic, this fiducial interval does not have exact frequentist (x¯ − e − e )2 n1/2 coverage. Although it is based on the pivotal quantity = K 1 + 1 k 1 ··· ˆ 2 n−1 σ1 = (XM/θ) W − − , 2 nk−1/2 α2n 1 + (1 − α)An 1 (x¯k−1 − ek − ek−1) × 1 + ˆ 2 it uses an incorrect percentile of the distribution of the σk−1 pivot. This is seen by noting that the fiducial interval is ob- − (x¯ − e )2 nk/2 1 tained from the probability statement Pr[W > 1]=1 − α, × 1 + k k , ˆ 2 butthevalue1usedastheα-percentile of W is incorrect. σk Instead, using the exact probability statement ˆ 2 = − 2 where K is the normalizing constant and σi (ni 1)si /ni is [α + (A/2)n(1 − α)](n−1)/n the MLE of σ 2. Hence the conditional pdf of µ given B = b Pr W > i i i [α + (A/2)n−1(1 − α)] for i = 1,...,k − 1, is 2 n1/2 would lead to an exact frequentist confidence interval. This (x¯1 − t − b1) f | (t) = C 1 + ··· observation also helps us demonstrate that the fiducial in- µ b1,...,bk−1 ˆ 2 σ1 terval has the correct frequentist coverage asymptotically, n − /2 because as n →∞ (x¯ − − t − b − )2 k 1 × 1 + k 1 k 1 [α + (A/2)n(1 − α)](n−1)/n σˆ 2 → k−1 − 1, [α + (A/2)n 1(1 − α)] − (x¯ − t)2 nk/2 1 × 1 + k , and W converges in distribution to a nondegenerate limit. ˆ 2 σk The issue of nonuniqueness observed in this example is re- where lated to the Borel paradox described by, for example, Casella ∞ 2 n1/2 and Berger (2002, sec. 4.9.3). (For a more in-depth discussion − (x¯ − t − b ) C 1 = dt 1 + 1 1 ··· of the various types of nonuniqueness associated with condi- −∞ ˆ 2 σ1 tional distributions, see Hannig 1996.) n − /2 (x¯ − − t − b − )2 k 1 Example 11: FGPQ for the common mean problem using × 1 + k 1 k 1 ˆ 2 the general construction. For a less trivial illustration of the σk−1 general construction for FGPQ, we once again consider the − (x¯ − t)2 nk/2 1 common mean problem, the subject of Examples 8 and 9. × 1 + k . ˆ 2 The FGPQ for µ in Example 9, obtained using the two-stage σk construction method, is by no means unique. An application of Setting bi = 0fori = 1,...,k − 1, yields the required fiducial the general construction yields a different FGPQ for µ. pdf for µ. This is given by ¯ = Consider the invertible pivotal relationship given by Xi 2 n1/2 2 nk−1/2 + 2 − 2 = 2 = (x¯1 − t) (x¯k−1 − t) µ σi /niEi, (ni 1)Si σi Ek+i,fori 1,...,k, where for f (t) = C 1 + ··· 1 + i = 1,...,k, E ∼ N(0, 1), E + ∼ χ2 , and E , i = 1,...,2k σˆ 2 σˆ 2 i k i ni−1 i 1 k−1 ¯ ¯ 2 2 are jointly independent. Let S = (X ,...,X , S ,...,S ) and − 1 k 1 k (x¯ − t)2 nk/2 1 E = (E ,...,E ).Lets = (x¯ ,...,x¯ , s2,...,s2) denote a re- × 1 + k . (10) 1 2k 1 k 1 k ˆ 2 alization of S, and let e be the corresponding realization of E. σk Hannig, Iyer, and Patterson: Fiducial Generalized Confidence Intervals 265

AFGPQRµ corresponding to this fiducial density may be Fisher’s method is an exact method, but the two GCI meth- obtained using the general construction. However, it is quite ods are only asymptotically exact. Nevertheless, we see that the convenient to use the fiducial framework when deriving the ac- small-sample performance of the two GCI methods are quite tual GCI. The FGPQ in this example does not appear to satisfy satisfactory, and they appear to be more efficient than Fisher’s the conditions of Theorem 1, but we can still prove directly that method. We also see that the GCI using the general construc- confidence intervals based on Rµ are asymptotically correct. tion performs slightly better than the two-stage GCI in terms of This is the message of the next proposition, the proof of which confidence interval expected length. is given in Appendix A. 6. CONNECTION BETWEEN A FIDUCIAL Proposition 3. Let all n1,...,nk approach infinity in such GENERALIZED PIVOTAL QUANTITY FOR A a way that cj = lim nj/(n1 +···+nk) exists and 0 < cj < 1. PARAMETER AND ITS FIDUCIAL DISTRIBUTION Then the 100(1 − α)% confidence interval for µ based on the R Fraser (1961) considered fiducial inference for a normal FGPQ µ corresponding to the fiducial density in (10) has as- mean µ with unit variance and outlined a method for provid- − ymptotically 100(1 α)% frequentist coverage. ing a interpretation for the fiducial distribution of µ. The unpublished technical report of Patterson et al. (2004a) He stated the following: gives the results of a detailed simulation study comparing three Let µ designate possible values for the parameter relative to an observed x¯;.... The statistical problem admits free translation on the x¯ axis. Consider a very different procedures for constructing confidence intervals for large number of samples from normal distributions with the specifications of the common mean problem: Fisher’s method [PQ1 in (6)], this example. In each case translate the sample mean....Theparametervalues GCI using the two-stage FGPQ, and the GCI using the general will be correspondingly translated. Simple mathematics then shows that the fre- quency distribution of these translated means µ is normal with center at x¯ and FGPQ construction. Fisher’s method was chosen based on the with unit . There is thus a of parameter results of Yu et al. (1999), which indicate that Fisher’s method values µ that might have produced the observed x¯. performs as well as or better than competing procedures in most What Fraser proposed in his 1961 article could be written, in situations. the notation of GPQs, as Table 1 gives a representative subset of the results from R = R S S = ¯ − ¯  − Patterson et al. (2004a). We report the case of k = 3 (three labo- µ µ( , ,ξ) X (X µ). (11) ratories) for selected sample sizes and standard deviations. The This becomes clear by noting that when Fraser considers first row for each scenario gives the empirical coverage proba- “a very large number of samples with the specifications of this bility for the upper confidence bound corresponding to a stated example,” he is actually considering the random variable X¯ , coverage of .95. The second row gives the empirical expected which is an independent copy of X¯ .Histranslated means are value of the absolute distance of the upper bound from the true produced by translating X¯ by an amount equal to E = X¯  − µ.  parameter value for each of the three methods. Smaller values Viewed thusly, Fraser’s µ is exactly the Rµ defined in (11). of the expected absolute distance imply greater efficiency of the When he describes the distribution of µ, he is essentially de- method. In Table 1 this expected absolute distance is referred to scribing the distribution of Rµ and the frequency interpretation as “expected length.” for the fiducial distribution of a parameter is then automatic.

Table 1. Empirical Coverages and Expected Lengths for Fisher’s Exact Upper Confidence Bound and Two Generalized Upper Bounds

n1 n2 n3 σ1 σ2 σ3 Two-stage General FGPQ Fisher 7 20 12 1.00 1.00 1.00 .9375 .9390 .9510 (.2791) (.2769) (.3025) 7 20 12 1.00 .10 16.00 .9450 .9440 .9510 (.1231) (.1216) (.1546) 10 10 10 1.00 1.00 1.00 .9345 .9340 .9480 (.3203) (.3189) (.3460) 50 50 50 1.00 1.00 1.00 .9470 .9447 .9490 (.1398) (.1393) (.1473) 50 50 50 1.00 .10 10.00 .9460 .9455 .9490 (.0728) (.0724) (.0866) 40 30 50 1.00 .10 10.00 .9485 .9483 .9493 (.0930) (.0925) (.1095) 40 30 50 1.00 10.00 .10 .9490 .9485 .9517 (.0731) (.0726) (.0957) 7 20 8 1.00 .25 1.00 .9435 .9443 .9485 (.1812) (.1785) (.2048) 100 80 90 1.00 .25 .25 .9517 .9503 .9527 (.0602) (.0598) (.0643) 100 80 90 1.00 16.00 .20 .9517 .9505 .9527 (.0717) (.0712) (.0849) 100 100 100 1.00 1.00 1.00 .9530 .9527 .9525 (.0974) (.0969) (.1020)

NOTE: In each of the last three columns, two numbers are reported for each combination of sample sizes and standard deviations. The first of these numbers is the empirical coverage probability, corresponding to a claimed coverage of .95. The second number, given in parentheses, is the empirical average absolute distance from the upper confidence bound to the true mean. 266 Journal of the American Statistical Association, March 2006

In general, FGPQs allow us to associate a distribution with arise from the particular invertible pivotal representation cho- a parameter θ akin to a fiducial distribution for θ.Forex- sen or from the particular choice of an ancillary statistic or both. ample, when considering an upper confidence bound for a In this article we have demonstrated that the same nonunique- scalar parameter θ based on Rθ , we find C(s, 1 − α) such that ness issues also arise for GCIs. What is noteworthy is that our  P(Rθ (s, S ,ξ)

k √ √ because both the numerator and the denominator converge uniformly S − + S S ¯ ¯ 2 2 ∈ A A g1,j,n( ,ξ) n Sj tj(ξ) nRn( , ,ξ) for (X1,...,Xk, S1,...,Sk ) .Alsoon , the limit of the denomi- j=1 nator is uniformly bounded away from 0, and the limit of the numerator is uniformly bounded away from 0, the uniform convergence in (A.7) follows by standard arguments. Note also that in (A.7), the quantity in ≤ C (S,γ)+ o(1) H the limit is nonzero. 268 Journal of the American Statistical Association, March 2006

Similar, but somewhat more complicated calculations show that Assumption A . ∈ Rk 2 ¯ − ¯ 2 2 ¯ − ¯ 2 1. Assume that there exists t(ξ) such that (nj/Sj ) l =j nl(Xj Xl)/Sl (cj/Sj ) l =j cl(Xj Xl)/Sl → √ D k 2 2 2 k 2 2 2  −  − → =  ( i=1 ni/Si ) σj ( i=1 ci/Si ) σj n S1 t1(ξ), . . . , Sk tk(ξ) N (N1,...,Nk) , uniformly on A.Sog1,j satisfy conditions 2(a) and 2(b) of Assump- where N has a nondegenerate distribution multivariate normal tion A. Finally, routine, though tedious, calculations show that the sec- distribution. R S S 2 2 ond partial derivatives of µ( , ,µ,σ1 ,...,σk ) are bounded on 2. Assuming existence and continuity of second partial derivatives A   the neighborhood . The statement then follows from Theorem 1. with respect to s of Rθ,l(s, s ,ξ), we have the following one- Proof of Proposition 3 term Taylor expansion with a remainder term: √ Assume that ni/n → ci ∈ (0, 1),wheren = ni.Wehave n(Sn − k D   2 2 Rθ,l(s, s ,ξ)= g0,l,n(s,ξ)+ g1,l,j,n(s,ξ) s − tj(ξ) m) → DZ,wherem = (µ,...,µ,σ ,...,σ ), Z = (Z ,...,Z k) are j 1 k 1 2 = iid N(0, 1) variables and D is a diagonal matrix given by j 1 √ √ + R (s, s,ξ). (B.1) σ σ σ 2 2 σ 2 2 l,n D = diag √1 ,..., √k , √1 ,..., √k . c1 ck c1 ck Here By Skorokhod’s theorem (see Billingsley 1995), we can find a se- g (s,ξ)= R s, t(ξ), ξ and S¯ S∗ S¯ 0,l,n θ,l quence √n independent of such that n has the same distribution S S¯ − → Z as and n( n m) D almost surely. = ∂ R  g1,l,j,n(s,ξ)  θ,l(s, s ,ξ) . Close examination of the density in (10) shows that, conditionally ∂s = ¯ j s t(ξ) on Sn, ⊂ Rk k √ Suppose that A is an open set containing t(ξ) with the fol- √ j=1 Zj cj/σj 1 n R (S¯ , S,ξ)− µ → N , a.s. lowing properties: µ n k 2 k 2 ∈ j=1 cj/σj j=1 cj/σj (a) The functions g1,l,j,n(s,ξ)converge uniformly in s A to a function g (s,ξ)continuous at s = t(ξ). R S ≤ 1,l,j If C(s, n) is chosen so that limn→∞ Pξ ( µ(s, ,ξ) C(s, (b) The matrix n)) = α, then we can prove in a similar way as in the proof of Theo-   ··· rem 1, g1,1,1(s,ξ) g1,1,k(s,ξ)   √ =  . . .  k J(s) . .. . ¯ 1 j=1 Zj cj/σj zα C(Sn, n) = µ + √ + g (s,ξ) ··· g (s,ξ) n k 2 k 2 1/2 1,d,1 1,d,k j= cj/σ ( j= cj/σ ) 1 j 1 j ∈ is of rank d for all s A. √ 1 = S + o √ , (A.8) (c) For each l 1,...,d, the remainder nRl,n(s, , n Pξ ξ)→ 0 uniformly in s on the open neighborhood A of t(ξ). where zα is the α quantile of the standard . Finally, 3. We consider a collection of open regions, C(X,γ) ⊂ Rd with S¯ S because the distribution of and are the same, (A.8) implies that λ(∂C(X,γ)) = 0 (i.e., the boundary has zero Lebesgue mea- ∈ P µ

Theorem B.1. Suppose that Assumption A holds and that γn → APPENDIX B: MULTIVARIATE VERSION γ ∈ (0, 1).Then OF THEOREM 1  lim Pξ θ = Rθ (S, S,ξ)∈ Cn Rθ (S, S ,ξ),γn = γ. In what follows we need the following notation. n→∞  In particular, Cn(R (S, S ,ξ),γ)is a confidence region for θ with as- Definition B.1. Open sets An converge to an open set A (i.e., θ ◦ ymptotic coverage probability equal to γ . An → A)if(lim An) = A. Here lim An = B exists if IA → IB, IA is ◦ n the indicator function of A and B is the interior of B. Remark B.1. If d = 1, then an example of C(X,γ) = (−∞, Let us consider a parametric statistical problem where we observe q(X,γ)),whereq(X,γ) is the γ -quantile of the distribution of X. If d > 1, then we can consider many different regions, for example, a X1,...,Xn, whose joint distribution belongs to some family of contin- p cubical equal-tailed region. uous distributions parameterized by ξ ∈ ⊂ R .LetS = (S1,...,Sk) denote a statistic based on the Xi’s. In theory, we can consider an in- Remark B.2. The various conditions stated in Assumption A could    dependent copy of X1,...,Xn and denote the statistic based on Xi ’s be weakened. For example, we do not have to assume that the limit- S R S S = by . Finally, suppose that a vector-valued function θ ( , ,ξ) ing random variable H is normal. Assumption A .3 and the proof of R S S R S S ( θ,1( , ,ξ),..., θ,d( , ,ξ)) is available that is a FGPQ for a Theorem B.1 would then have to be modified accordingly. parameter θ = π(ξ) ∈ Rd, d ≤ k. In addition, assume that the following holds. [Received November 2004. Revised May 2005.] Hannig, Iyer, and Patterson: Fiducial Generalized Confidence Intervals 269

REFERENCES Iyer, H. K., and Patterson, P. L. (2002), “A Recipe for Constructing Generalized Pivotal Quantities and Generalized Confidence Intervals,” Technical Report Barnard, G. A. (1977), “Pivotal Inference and the Bayesian Controversy,” Bul- 2002/10, Colorado State University, Dept. of Statistics. letin of the International Statistical Institute, 47, 543Ð551. Iyer, H. K., Wang, C. M., and Mathew, T. (2004), “Models and Confidence (1981), “A Coherent View of for Public Science,” Intervals for True Values in Interlaboratory Trials,” Journal of the American presented at Symposium on Statistical Inference and Applications, University Statistical Association, 99, 1060Ð1071. of Waterloo, August 1981. Johnson, N. L., and Kotz, S. (1970), Continuous Univariate Distributions 2, (1982), “Discussion: The Functional-Model Basis of Fiducial Infer- Boston: Houghton Mifflin. ence,” The Annals of Statistics, 10, 1068Ð1069. Jordan, S. M., and Krishnamoorthy, K. (1996), “Exact Confidence Intervals for (1995), “Pivotal Models and the Fiducial Argument,” International the Common Mean of Several Normal Populations,” Biometrics, 52, 77Ð86. Statistical Review, 63, 309Ð323. Krishnamoorthy, K., and Lu, Y. (2003), “Inferences on the Common Mean Behrens, W.-V. (1929), “Ein Beitrag zur Fehlerberchnung bei Wenigen of Several Normal Populations Based on the Generalized Variable Method,” Beobachtungen,” Landwirtschaftliche Jahrbucher, 68, 807Ð837. Biometrics, 59, 237Ð247. Billingsley, P. (1995). Probability and Measure (3rd ed.), New York: Wiley. Krishnamoorthy, K., and Mathew, T. (2003), “Inferences on the Means of i Log- Bross, I. (1950), “Fiducial Intervals for Variance Components,” Biometrics,6, normal Distributions Using Generalized p-Values and Generalized Confi- 136Ð144. dence Intervals,” Journal of Statistical Planning and Inference, 115, 103Ð121. Burdick, R. K., Borror, C. M., and Montgomery, D. C. (2005), Design and Linnik, J. V. (1968), Statistical Problems With Nuisance Parameters, translated Analysis of Gauge R&R Studies: Making Decisions With Confidence Intervals from the 1966 Russian edition, Providence, RI: American Mathematical So- in Random and Mixed ANOVA Models, Philadelphia: ASA/SIAM. ciety. Burdick, R. K., and Park, D. J. (2003), “Performance of Confidence Inter- Mathew, T., and Krishnamoorthy, K. (2004), “One-Sided Tolerance Limits in vals in Regression Models With Unbalanced One-Fold Nested Error Struc- Balanced and Unbalanced One-Way Random Models Based on Generalized tures,” Communications in Statistics, Part B—Simulation and Computation, Confidence Limits,” Technometrics, 46, 44Ð52. 32, 717Ð732. McNally, R. J., Iyer, H. K., and Mathew, T. (2001), “Tests for Individual Burdick, R. K., Park, Y.-J.,Montgomery, D. C., and Borror, C. M. (2005), “Con- and Population Bioequivalence Based on Generalized p Values,” Statistics fidence Intervals for Misclassification Rates in a Gauge R&R Study,” Journal in Medicine, 22, 31Ð53. of Quality Technology, to appear. Patterson, P. L., Hannig, J., and Iyer, H. K. (2004a), “Fiducial Generalized Con- Casella, G., and Berger, R. L. (2002), Statistical Inference (2nd ed.), Belmont, fidence Intervals for the Common Mean of K Normal Populations,” Technical CA: Duxbury Press. Report 2004/10, Colorado State University, Dept. of Statistics. Chang, Y.-P., and Huang, W.-T. (2000), “Generalized Confidence Intervals for (2004b), “Fiducial Generalized Confidence Intervals for Proportion of the Largest Value of Some Functions of Parameters Under Normality,” Statis- Conformance,” Technical Report 2004/11, Colorado State University, Dept. tica Sinica, 10, 1369Ð1383. of Statistics. Chiang, A. K. L. (2001), “Simple General Method for Constructing Confi- Peterson, J. J., Berger, V., and Weerahandi, S. (2003), “Generalized P-Values dence Intervals for Functions of Variance Components,” Technometrics, 43, and Confidence Intervals: Their Role in Statistical Methods for Pharmaceuti- 356Ð367. cal Research and Development,” technical report. Chou, Y., and Owen, D. B (1984), “One-Sided Confidence Regions on the Up- Rao, C. R. (1973), Linear Statistical Inference and Its Applications,NewYork: per and Lower Tail Areas of Normal Distribution,” Journal of Quality Tech- Wiley. nology, 16, 150Ð158. Roy, A., and Mathew, T. (2005), “A Generalized Confidence Limit for the Re- Cochran, W. G. (1964), “Approximate Significance Levels of the BehrensÐ liability Function of a Two-Parameter Exponential Distribution,” Journal of Fisher Test,” Biometrics, 20, 191Ð195. Statistical Planning and Inference, 128, 509Ð517. Daniels, L., Burdick, R. K., and Quiroz, J. (2005), “Confidence Intervals in a Satterthwaite, F. E. (1942), “A Generalized ,” Annals of Gauge R&R Study With Fixed Operators,” Journal of Quality Technology, , 13, 34Ð41. 37, 179Ð185. (1946), “An Approximate Distribution of Estimates of Variance Com- Dawid, A. P., and Stone, M. (1982), “The Functional-Model Basis of Fiducial ponents,” Biometrics Bulletin, 2, 110Ð114. Inference,” The Annals of Statistics, 10, 1054Ð1067. Stein, C. (1959), “An Example of Wide Discrepancy Between Fiducial and Fisher, R. A. (1935), “The Fiducial Argument in Statistical Inference,” Annals Confidence Intervals,” The Annals of Mathematical Statistics, 30, 877Ð880. of Eugenics, VI, 91Ð98. Sukhatme, B. V. (1958), “Testing the Hypothesis That Two Populations Differ (1939), “Samples With Possibly Unequal Variances,” Annals of Eu- Only in Location,” The Annals of Mathematical Statistics, 29, 60Ð78. genics, IX, 174Ð180. Tsui, K. W., and Weerahandi, S. (1989), “Generalized p-Values in Significance (1961a), “ the Reference Set,” Sankhya¯, Ser. A, 23, 3Ð8. Testing of Hypotheses in the Presence of Nuisance Parameters,” Journal of (1961b), “The Weighted Mean of Two Normal Samples With Un- the American Statistical Association, 84, 602Ð607. known Variance Ratio,” Sankhya¯, Ser. A, 23, 103Ð114. Tukey, J. W. (1951), “Components in Regression,” Biometrics, 7, 33Ð69. (1970), Statisitical Methods for Research Workers (14th ed.), Oxford, U.S. Food and Drug Administration (2001), “Guidance to Industry: Statistical U.K.: Oxford University Press. Approaches to Establishing Bioequivalence,” Rockville, MD: U.S. Depart- Fraser, D. A. S. (1961), “On Fiducial Inference,” The Annals of Mathematical ment of Health and Human Services, Food and Drug Administration, Center Statistics, 32, 661Ð676. for Drug Evaluation and Research (CDER). (1966), “Structural Probability and a Generalization,” Biometrika, 53, Wang, C. M., and Lam, C. T. (1996), “Confidence Limits for Proportion of 1Ð9. Conformance,” Journal of Quality Technology, 28, 439Ð445. (1968), The Structure of Inference, New York: Krieger. Weerahandi, S. (1991), “Testing Variance Components in Mixed Models With Graybill, F. A., and Wang, C. M. (1980), “Confidence Intervals on Nonnega- Generalized p-Values,” Journal of the American Statistical Association, 86, tive Linear Combinations of Variances,” Journal of the American Statistical 151Ð153. Association, 75, 869Ð873. (1993), “Generalized Confidence Intervals,” Journal of the American Grundy, P. M. (1956), “Fiducial Distributions and Prior Distributions: An Ex- Statistical Association, 88, 899Ð905. ample in Which the Former Cannot Be Associated With the Latter,” Journal (1995), Exact Statistical Methods for Data Analysis,NewYork: of the Royal Statistical Society, Ser. B, 18, 217Ð221. Springer-Verlag. Hamada, M., and Weerahandi, S. (2000), “Measurement System Assessment (2004), Generalized Inference in Repeated Measures,NewYork:Wi- via Generalized Inference,” Journal of Quality Technology, 32, 241Ð253. ley. Hannig, J. (1996), “On Conditional Distributions as Limits of Martingales,” Welch, B. L. (1947), “The Generalization of ‘Student’s’ Problem When Several unpublished dissertation, Charles University, Prague [in Czech]. Different Population Variances Are Involved,” Biometrika, 34, 29Ð35. (2005), “On Multidimensional Fiducial Generalized Confidence Inter- Wilkinson, G. N. (1977), “On Resolving the Controversy in Statistical Infer- vals,” Technical Report 2005/1, Colorado State University, Dept. of Statistics. ence,” Journal of the Royal Statistical Society, Ser. B, 39, 119Ð171. Healy, M. J. R. (1963), “Fiducial Limits for a Variance Component,” Journal of Williams, J. S. (1962), “A Confidence Interval for Variance Components,” the Royal Statistical Society, Ser. B, 25, 128Ð130. Biometrics, 49, 278Ð281. Iyer, H. K., and Mathew, T. (2002), Comments on “Simple General Method for Yu, P. L. H., Sun, Y., and Sinha, B. K. (1999), “On Exact Confidence Intervals Constructing Confidence Interval for Functions of Variance Components” by for the Common Mean of Several Normal Populations,” Journal of Statistical A. K. L. Chiang, Technometrics, 44, 284Ð285. Planning and Inference, 81, 263Ð277.