9 U-STATISTICS
9 U-STATISTICS
Suppose X 1, X 2, . . . , X n are P i.i.d. with CDF F . Our goal is to estimate the expectation 2P t (P)=Eh(X 1, X 2, . . . , X m ). Note that this expectation requires more than one X in contrast to 2 EX , EX , or Eh(X ). One example is E X 2 X 1 or P((X 1, X 2) S). | | 2 For Eh(X 1), the empirical estimator is asymptotically optimal. Now, U-statistics general- ize the idea. The advantage of using U-statistics is unbiasedness.
Notation. Let t (P)= h(x 1, . . . , x m )d F (x 1) d F (x m ) where h is a known function; h is called kernel. Assum···e that, in the following···, without loss of generality, h is symmet- ric, i.e. h x , x h xR , x R. If h is not symmetric, we can replace it with h x , . . . , x ( 1 2)= ( 2 1) ⇤( 1 m )= 1 m ! h x 1 , . . . , x m where ⇡ : and is set of all possible permutations of ( ) ⇡ ⇧ ( ⇡( ) ⇡( )) N N ⇧ 1, . . . , m .2Note that h is also unbiased, 7! ⇤ { P} 1 i.i.d 1 Eh⇤ =(m !) Eh(X⇡(1), . . . , X⇡(m )) =(m !) Eh(X 1, . . . , X m )=t (P). ⇡ ⇧ ⇡ ⇧ X2 X2 Definition 9.1. Suppose h : Rm R is symmetric in its arguments. The U-statistic for esti- 7! mating t (P)=Eh(X 1, . . . , X m ) is a symmetric average 1 u = u (X 1, . . . , X m )= n h(X i 1 , . . . , X i m ). m 1 i 1