Robust Bayes and Empirical Bayes Analysis with &Epsilon

Robust Bayes and Empirical Bayes Analysis with # -Contaminated Priors James Berger; L. Mark Berliner The Annals of Statistics, Vol. 14, No. 2. (Jun., 1986), pp. 461-486. Stable URL: http://links.jstor.org/sici?sici=0090-5364%28198606%2914%3A2%3C461%3ARBAEBA%3E2.0.CO%3B2-J The Annals of Statistics is currently published by Institute of Mathematical Statistics. Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at http://www.jstor.org/about/terms.html. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you may use content in the JSTOR archive only for your personal, non-commercial use. Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at http://www.jstor.org/journals/ims.html. Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission. JSTOR is an independent not-for-profit organization dedicated to and preserving a digital archive of scholarly journals. For more information regarding JSTOR, please contact [email protected]. http://www.jstor.org Mon May 21 10:06:28 2007 i\iiii<~Ls<~,fStnt~stirx I!JHli. VoI. 14, No. 9, 461-4Rf; ROBUST BAYES AND EMPIRICAL BAYES ANALYSIS WITH E-CONTAMINATEDPRIORS Purdue University and Ohio State University For Bayesian analysis, an attractive method of modelling uncertainty in the prior distribution is through use of E-contaminationclasses, i.e., classes of distributions which have the form li = (1 - E)$ + ~q,110 being the base elicited prior, q being a "contamination," and E reflecting the amount of error in T,, that is deemed possible. Classes of contaminations that are considered include (i) all possible contaminations, (ii) all symmetric, unimodal contaminations, and (iii) all contaminations such that li is unimodal. Two issues in robust Bayesian analysis are studied.The first is that of determining the range of posterior probabilities of a set as li ranges over the E-contamination class. The second, more extensively studied, issue is that of selecting, in a data dependent fashion, a "good" prior distribution (the Type-I1 maximum likelihood prior) from the E-contaminationclass, and using this prior in the subsequent analysis. Relationships and applications to empirical Bayes analysis are also discussed. 1. Introduction. The most frequent criticism of subjective Bayesian analysis is that it supposedly presumes an ability to completely and accurately quantify subjective information in terms of a single prior distribution. However, there has long existed [at least since Good (1950)l a robust Bayesian viewpoint which assumes only that subjective information can be quantified in terms of a class I' of possible distributions. The goal is then to make inferences or decisions which are robust over I', i.e., which are relatively insensitive (or at least are satisfactory) to deviations as the prior distribution varies over I'. We will not consider the philosophical or pragmatic reasons for adopting this viewpoint. Such a discussion, along with a review of the area, may be found in Berger (1984). (We also do not mean to imply that the single prior Bayesian approach is necessarily bad; it usually works very well.) Related to this are various forms of empirical Bayes analysis [cf. Morris (1983) for discussion and review], in which the prior distribution is also assumed to belong to some class I' of distributions. Indeed, Section 5 considers some familiar empirical Bayes problem from our perspective. Also, see Berger and Berliner (1984). Before discussing'implementation of the robust Bayesian viewpoint, some notation is helpful. Let .Xdenote the observable random variable (or vector), which will (for simplicity) be assumed to have a density f(xl0) (w.r.t. some Received November 1983; revised October 1985. ' Itesearch supported by the National Science Foundation under Grants MCS-8101670A1 and IIMS-8401996. 'Research supported by the Office of Naval Research under Contract N00014-84-K-0422. AMS 1980 subject classtfications. Primary 62A15; secondary 62F15. Key words andphrases. Robust Bayes, empirical Bayes, classes of priors, E-contamination,type I1 maximum likelihood, hierarchical priors. 462 .I. RERGEIt ANI) I,. M. HEI<I,INEI< measure), where 6' is an unknown parameter lying in a parameter space O. A prior distribution on O will be denoted by n (later, in examples, n will be used to denote either a prior or its corresponding density), and the resulting marginal density of X is given by The,posterior distribution of 6' given x (assuming it exists) will be denoted by n(. Ix) and, in nice situations, is defined by Finally, let 9 denote the space of all probability distributions on O. The class, T, of prior distributions to be considired in this paper, is the &-contaminationclass; namely, (1.1) r = {n: n = (1 - &)no+ ~q,q €21, where 0 I E I1 is given, no is a particular prior distribution, and 9 is some subset of .P. There are several reasons for consideration of this class. First, and foremost, it is a sensible class to consider in light of the prior elicitation process. The extensive and rapidly developing methodology on prior elicitation [cf. Kadane et al. (1980)l makes specification of an initial believable prior, q,,an attractive starting point. However, in determining n,, sensibly, one will make probability judgements about subsets of O, judgements which could be in error by some amount E. Stated another way, further reflection might lead to alterations of probability judgements by an amount E. Hence, possible priors involving such alterations should be included in T. Many classes of priors which have been considered are not sensible from the above viewpoint. For instance, classes of priors involving restrictions on moments force severe restrictions on the allowable prior tails. This makes little sense from the elicitation viewpoint, since the tails of a prior involve very small probabilities and are, therefore, nearly impossible to determine. Similarly, classes of conjugate priors are too limited, particularly in their inflexible tail behavior [cf. Berger (1985)l. Two other major reasons for choosing r as in (1.1) are (i) such r are (as we shall see) surprisingly easy to work with; and (ii) such r are very flexible through choice of 9. In this paper we will restrict consideration to four interesting choices of 9. First, in Section 2 the choice 9 = B (all distributions) will be considered. This choice is easy to work with and is, in some sense, conservative. In Section 3 we consider the class, 2, of all contaminations which are symmetric and unimodal. This is again very easy to work with. In Section 4 we consider the class of all contaminations such that the resulting n is unimodal (assuming that q,is unimodal). It came as a great surprise to us that such a complicated class could be worked with and provide reasonably simple answers. Finally, in Section 5 we consider 3 that are mixtures of various classes. The purpose of the section is to show how easily mixed contaminations can be dealt with and also to apply the methodology in some typical empirical Bayes situations. ROBUST BAYESIAN ANAI,YSIS 463 Other articles that have used &-contamination classes of priors include Schneeweiss (1964), Blum and Rosenblatt (1967), Huber (1973), Marazzi (1985), Bickel (1984), and Berger (1982,1984). Except for Huber (1973), these articles work within the frequentist Bayesian framework, whereas our approach will be almost entirely conditional Bayesian. Huber (1973) is discussed below and in Section 2.4. There is a substantial literature working with other types of classes of priors [cf. Leamer (1978) and DeRobertis and Hartigan (1981)], and with the very related idea of "upper" and "lower" probabilities. Berger (1984) contains considerable review and discussion of this literature. We strongly prefer the class in (1.1) for intuitive content and ease of analysis. The ideal analysis, to a robust Bayesian, is one in which it can be shown that the inference or decision to be made is essentially the same for any prior in r. [Indeed, it can be argued-see Berger (1984,1985)-that this is the only way in which a statistical conclusion can claim to be ultimately sound.] What is needed, to provide such conclusions, is essentially the ability to find minimums and maximums of criterion functions as 71 ranges over I'. We illustrate this approach in Section 2.4, where, for 2 = {all distributions), the range of posterior probabilities of a (fixed) set C is given [essentially following Huber (1973)l. This allows finding the range of posterior probabilities of confidence sets and the range of posterior probabilities of hypotheses, for such I'. Unfortunately, there are certain inadequacies in assuming that 2 = {all distributions} (see Section 2.3), and attempting the above program with more reasona- ble r (such as that in Section 4) becomes more difficult. A number of alternative approaches to the problem of dealing with classes of priors have thus been proposed, essentially leading to the choice of a single "robust" prior, decision, or inference. Berger (1984,1985) discusses various of these methods, including the appealing technique of putting a prior distribution on I?. [Such a prior is called a hyperprior or a Type I1 probability distribution by Good (1965,1980) and a second-stage prior in certain situations by Lindley and Smith (1972).] Of course, this corresponds to using a certain single prior (the "average" over r), but one would suspect that the resulting Bayes rule would be quite robust with respect to I'.

Load more