Stat 609: Mathematical Statistics Lecture 24

Stat 609: Mathematical Statistics Lecture 24

Lecture 24: Completeness Definition 6.2.16 (ancillary statistics) A statistic V (X) is ancillary iff its distribution does not depend on any unknown quantity. A statistic V (X) is first-order ancillary iff E[V (X)] does not depend on any unknown quantity. A trivial ancillary statistic is V (X) ≡ a constant. The following examples show that there exist many nontrivial ancillary statistics (non-constant ancillary statistics). Examples 6.2.18 and 6.2.19 (location-scale families) If X1;:::;Xn is a random sample from a location family with location parameter m 2 R, then, for any pair (i;j), 1 ≤ i;j ≤ n, Xi − Xj is ancillary, because Xi − Xj = (Xi − m) − (Xj − m) and the distribution of (Xi − m;Xj − m) does not depend on any unknown parameter. Similarly, X(i) − X(j) is ancillary, where X(1);:::;X(n) are the order statistics, and the sample variance S2 is ancillary. beamer-tu-logo UW-Madison (Statistics) Stat 609 Lecture 24 2015 1 / 15 Note that we do not even need to obtain the form of the distribution of Xi − Xj . If X1;:::;Xn is a random sample from a scale family with scale parameter s > 0, then by the same argument we can show that, for any pair (i;j), 1 ≤ i;j ≤ n, Xi =Xj and X(i)=X(j) are ancillary. If X1;:::;Xn is a random sample from a location-scale family with parameters m 2 R and s > 0, then, for any (i;j;k), 1 ≤ i;j;k ≤ n, (Xi − Xk )=(Xj − Xk ) and (X(i) − X(k))=(X(j) − X(k)) are ancillary. If V (X) is a non-trivial ancillary statistic, then the the set fx : V (x) = vg does not contain any information about q. If T (X) is a statistic and V (T (X)) is a non-trivial ancillary statistic, it indicates that the reduced data set by T contains a non-trivial part that does not contain any information about q and, hence, a further simplification of T may still be needed. A sufficient statistic T (X) appears to be most successful in reducing the data if no nonconstant function of T (X) is ancillary or beamer-tu-logo even first-order ancillary, which leads to the following definition. UW-Madison (Statistics) Stat 609 Lecture 24 2015 2 / 15 Definition 6.2.21 (completeness) Let X be a sample from a family indexed by q 2 Θ (the parameter space) and let Eq and Pq be the expectation and probability, respectively, calculated with respect to a q 2 Θ. A statistic T (X) is complete iff for any function g not depending on q, Eq [g(T )] = 0 for all q 2 Θ implies Pq (g(T ) = 0) = 1 for all q 2 Θ. A statistic T is boundedly complete iff the previous statement holds for any bounded g. The family of distributions corresponding to a statistic T is complete (or boundedly complete) iff T is complete (or boundedly complete). A complete statistic is boundedly complete. If T is complete (or boundedly complete) and S = y(T ) for a measurable y, then S is complete (or boundedly complete). It can be shown that a complete and sufficient statistic is minimal sufficient (Theorem 6.2.28). beamer-tu-logo A minimal sufficient statistic is not necessarily complete. UW-Madison (Statistics) Stat 609 Lecture 24 2015 3 / 15 Example 6.2.15. In this example, X1;:::;Xn is a random sample from uniform(q;q + 1), q 2 R, and we showed that T = (X(1);X(n)) is the minimal sufficient statistic for q. We now show that T is not complete. Note that V (T ) = X(n) − X(1) = (X(n) − q) − (X(1) − q) is in fact ancillary. Its distribution can be obtained using the result in Example 5.4.7, but we do not need that for arguing that T is not complete. It is easy to see that Eq (V ) exists and it does not depend on q since V is ancillary. Letting c = E(V ), we see that Eq (V − c) = 0 for all q. Thus, we have a function g(x;y) = x − y − c such that Eq [g(X(1);X(n))] = Eq (V − c) = 0 for all q but Pq (g(X(1);X(n)) = 0) = Pq (V = c) 6= 0. This shows that T is not complete. If a minimal sufficient statistic T is not complete, then there is a non-trivial first order ancillary statistic V (T ); beamer-tu-logo there does not exist any complete statistic. UW-Madison (Statistics) Stat 609 Lecture 24 2015 4 / 15 Example 6.2.22. Let T ∼ binomial(n;q), 0 < q < 1, (note that T is a sufficient statistic based on a random sample of size n from binomial(1;q)). If g is a function such that Eq [g(T )] = 0 for all q, then n n t n−t 0 = Eq [g(T )] = ∑ g(t) q (1 − q) t=0 t n n q t = (1 − q)n ∑ g(t) all q 2 (0;1) t=0 t 1 − q Since the factor (1 − q)n 6= 0, we must have n n q 0 = ∑ g(t) jt all j > 0; j = t=0 t 1 − q The last expression is a polynomial in j of degree n. For this polynomial to be 0 for all j > 0, it must be true that the t n coefficient of j , which is g(t) t , is 0 for every t. This shows that g(t) = 0 for t = 0;1;:::;n and hence Pq (g(T ) = 0)beamer-tu-logo = 1 for all q, i.e., T is complete. UW-Madison (Statistics) Stat 609 Lecture 24 2015 5 / 15 Example 6.2.23. Let X1;:::;Xn be iid from uniform(0;q), q > 0, with pdf 1 0 < x < q;i = 1;:::;n 1 0 < x < q f (x) = i = (n) q 0 otherwise 0 otherwise By the factorization theorem, X(n) is sufficient for q. Using the result for the order statistics in Chapter 5, X(n) has pdf ntn−1q −n 0 < t < q f (t) = X(n) 0 otherwise For any g, Z q −n n−1 Eq [g(X(n))] = nq g(t)t dt = 0 all q > 0 0 implies that Z q d Z q 0 = g(t)tn−1dt and 0 = g(t)tn−1dt = g(q)q n−1 all q > 0 0 dq 0 Thus, g(t) = 0 for all t > 0, which means that P (g(T ) = 0) = 1 for all q. q beamer-tu-logo Hence, X(n) is also complete. UW-Madison (Statistics) Stat 609 Lecture 24 2015 6 / 15 The relationship between an ancillary statistic and a complete and sufficient statistic is characterized in the following result. Theorem 6.2.24 (Basu’s theorem) Let V and T be two statistics of X from a population indexed by q 2 Θ. If V is ancillary and T is boundedly complete and sufficient for q, then V and T are independent with respect to Pq for any q 2 Θ. Proof. Let B be an event on the range of V and A an event on the range of T . From the 3rd definition of the independence of random variables, we only need to show that −1 −1 −1 −1 Pq (T (A) \ V (B)) = Pq (T (A))Pq (V (B)); q 2 Θ −1 Since V is ancillary, Pq (V (B)) = PB does not depend on q. As T is sufficient, Eq [IB(V )jT ] = hB(T ) is a function of T (not depending on q), where IB(V ) is the indicator function of fV 2 Bg. Since −1 beamer-tu-logo Eq [hB(T )] = Eq fE[IB(V )jT ]g = Eq fIB(V )g = Pq (V (B)) = PB q 2 Θ; UW-Madison (Statistics) Stat 609 Lecture 24 2015 7 / 15 by the bounded completeness of T , Pq (hB(T ) = PB) = 1 q 2 Θ Then the result follows from −1 −1 Pq (T (A) \ V (B)) = Eq fEq [IA(T )IB(V )jT ]g = Eq fIA(T )Eq [IB(V )jT ]g = Eq fIA(T )PBg = PBEq fIA(T )g −1 −1 = Pq (T (A))P(V (B)) q 2 Θ If a minimal sufficient statistic T is not complete, then there may be an ancillary statistic V such that V and T are not independent. An example is in Example 6.2.15, T = (X(1);X(n)) is minimal sufficient but not complete, and T and the ancillary statistic V = X(n) − X(1) is not independent. Basu’s theorem is useful in proving the independence of two statistics. beamer-tu-logo We first state without proof the following useful result. UW-Madison (Statistics) Stat 609 Lecture 24 2015 8 / 15 Theorem 6.2.25 (complete statistics in exponential families) Let X = (X1;:::;Xn) be a random sample with pdf k ! fq (x) = h(x)c(q)exp ∑ wj (q)tj (x) q 2 Θ; j=1 Then the statistic n n ! T (X) = ∑ t1(Xi );:::; ∑ tk (Xi ) i=1 i=1 is complete as long as Θ contains an open set in Rk (i.e., the family of distributions is of full rank). Note that T is also sufficient for q (without requiring any condition on Θ). Compared with the result of minimal sufficient statistics in curved exponential families, the condition on Θ in this theorem is stronger. We illustrate the application of Basu’s theorem and Theorem 6.2.25beamer-tu-logo in the normal distribution family.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    15 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us