arXiv:0705.3496v3 [math.PR] 12 Jan 2010 tetnsteoeprmtrfml fdsrbtoskonas known distributions of family one-parameter tributions the extends It eateto ahmtc,Sg nvriy aa840-8502 Saga University, E-mail: Saga Mathematics, of Department HANDA KENJI process point Poisson–Dirichlet two-parameter The DOI: Bernoulli 1350-7265 [ Yor and PD( Pitman denoted information. distribution, Poisson–Dirichlet bibliographic parameter and topics related func of view Keywords: of point the distribution. Markov–Kr from a Poisson–Dirichlet The parameter discussed are case. we is Yor, one-parameter process and the Dirichlet Pitman for to known due previously approach Poisson–Diri transform two-parameter Laplace t the the in the of points apply we structure as this, Using mathematical masses functions. the correlation the regarding of associat terms by the obtained consider of process hence characterization point and A 1 distribution. sum discrete with sequences pro decreasing a positive is distribution Poisson–Dirichlet two-parameter The ie 0 Given dis probability of family two-parameter a simplex with infinite-dimensional concerned is paper This Introduction 1. distribution hsi neetoi ern fteoiia ril publis 1082–1116 detail. article 4, typographic original No. the 15, of Vol. reprint 2009, electronic an is This 10.3150/08-BEJ180 [email protected] ≤ hc eeitoue yKnmn[ Kingman by introduced were which , 15 orlto ucin akvKeniett;pitproce point identity; Markov–Krein function; correlation
< α c 4,20,1082–1116 2009, (4), 09ISI/BS 2009 V and 1 1 ∇ = ∞ Y = 1 > θ , ( − v i α ( = ) V en eune( sequence a define , ∇ i hsrpitdffr rmteoiia npgnto and pagination in original the from differs reprint This . (1 = ∞ v 1 fnnngtv eraigsqecswt u 1, sum with sequences decreasing non-negative of v , − 2 . . . , Y 1 ) : ) v (1 1 ≥ − v Y V 2 i i 32 ≥ ≥ − frno aibe by variables random of ) 1 dpitpoes(hti,terandom the is, (that process point ed ;se o xml,[ example, for see, ]; ) e ythe by hed aiiydsrbto ntettlt of totality the on distribution bability Y hoyo on rcse orva the reveal to processes point of theory i he itiuin lo developing Also, distribution. chlet ,θ α, Japan. , inlaayi ae ntetwo- the on based analysis tional dt oenmse farandom a of masses govern to ed 0 epstv elln)i ie in given is line) real positive he i dniyfrtegeneralized the for identity ein , ,i h olwn manner. following the in ), ( i l oetn eea results several extend to ble 2 = s Poisson–Dirichlet ss; v ISI/BS i oso–iihe dis- Poisson–Dirichlet , 1 = 45 3 . . . , endtetwo- the defined ] in . rbtoso the on tributions ) , Bernoulli 1 , 44 , 50 , (1.1) for ] The two-parameter Poisson–Dirichlet point process 1083 where Y , Y ,... are independent and each Y is Beta(1 α, θ + iα)-distributed. Let (V ) 1 2 i − i be the decreasing order statistics of (V ), namely V V are the ranked values of i 1 ≥ 2 ≥ (Vi), and define PD(α, θ) to be the law of (Vi) on . There is some background to the study of these distributions, as explained in [45] and∇∞ [44]. In particular, a number of results concerning PD(α, θ) were obtained in [45]. The proofs there use auxiliary random variables and related processes (such as stable subordinators and gamma processes) and require deep insight into them. As for the original Poisson–Dirichlet distributions, which form a one-parameter family PD(0,θ): θ> 0 and correspond to the gamma processes, certain independence property{ often makes the} analysis relatively easier. See [50] and [51] for extensive discussions. One purpose of this article is to provide another approach to study the two-parameter family PD(α, θ):0 α< 1,θ> α , based mainly on conventional arguments in the theory{ of point processes;≤ see, for− example,} [7] for general accounts of the theory. This means that a random element (Vi) governed by PD(α, θ) is studied through the ran- dom point process ξ := δVi , which we call the (two-parameter) Poisson–Dirichlet point process with parameters (α, θ), or simply the PD(α, θ) process. Note that although the above ξ is a ‘non-labeled’ object, it is sufficient to recover the law of the ranked sequence (V ). For example, we have ξ([t, )) = 0 precisely when V
2. The correlation function of the PD(α, θ) process
The main subject of this section is the correlation function. We begin with some defi- nitions and concepts from the theory of point processes [7]. (See also Section 2 in [27] for a comprehensive exposition of this material.) Let ξ be a point process taking values in R. For simplicity, suppose that ξ is expressed as ξ = δXi for some random vari- ables X1,X2,.... Throughout what follows, we assume that ξ is simple, in the sense that Xi = Xj (i = j) a.s. For any positive integer n, the nth correlation measure (also called the nth factorial moment measure) of ξ, if it exists, is defined to be a σ-finite measure The two-parameter Poisson–Dirichlet point process 1085
n µn such that for non-negative measurable functions f on R ,
E f(Xi1 ,...,Xin ) = f(x1,...,xn)µn(dx1 dxn), (2.1) Rn i1,...,in(=) where the sum is taken over n-tuples of distinct indices. In particular, µ1 is the mean measure of ξ and it follows that µ (dx dx ) is necessarily symmetric in x ,...,x . n 1 n 1 n If µn has a density with respect to the n-dimensional Lebesgue measure, the density is called the nth correlation function of ξ. In what follows, α and θ are such that 0 α< 1 and θ> α, unless otherwise men- ≤ − tioned. Notice that the PD(α, θ) process ξ = δ = δVi , with (Vi) and (Vi) defined Vei as in the Introduction, is simple. Denote by cn,α,θ the product n n θ , α = 0, Γ(θ +1+(i 1)α) n 1 − = Γ(θ + 1)Γ(θ/α + n)α − Γ(1 α)Γ(θ + iα) , 0 <α< 1. i=1 − Γ(θ + αn)Γ(θ/α + 1)Γ(1 α)n − In view of the left-hand side, it is clear that
c = c c (m,n 0, 1, 2,... ) (2.2) m+n,α,θ m,α,θ n,α,θ+αm ∈{ } with the convention that c0,α,θ = 1. We also need the following notation for the n- dimensional unit simplex:
∆ = (v ,...,v ): v 0,...,v 0, v + + v 1 . n { 1 n 1 ≥ n ≥ 1 n ≤ }
In general, the indicator function of a set (or an event) E is denoted by 1E . The main result of this section generalizes Watterson’s formula [54] ((18), page 644) for the corre- lation functions of the PD(0,θ) process to the PD(α, θ) process.
Theorem 2.1. For each n =1, 2,..., the nth correlation function of the PD(α, θ) process is given by
θ+αn 1 n n − (α+1) q (v ,...,v ) := c v− 1 v 1 (v ,...,v ). (2.3) n,α,θ 1 n n,α,θ i − j ∆n 1 n i=1 j=1 We will give a proof based on a known property of the size-biased permutation for PD(α, θ). For the one-parameter family PD(0,θ): θ> 0 , such an idea to derive (2.3) is suggested in [39], Corollary 7.4. From the{ definition of size-biased} permutation (see [45] or Section 4 of [3]), we can make the following observation. If a sequence (Vi) of random variables such that Vi > 0 (i =1, 2,...) and Vi = 1 is given, then, for each n =1, 2,..., the nth correlation measure µn of δV exists and i n i 1 − 1 ♯ µ (dv dv )= v− 1 v 1 (v ,...,v )µ (dv dv ), (2.4) n 1 n i − j ∆n 1 n n 1 n i=1 j=1 1086 K. Handa
♯ ♯ ♯ where µn is the joint law of (V1 ,...,Vn ), the first n components of the size-biased permu- tation (V ♯) of(V ). In (2.4) and throughout, we adopt the convention that 0 = 0. i i j=1 Proof of Theorem 2.1. Let (Vi) have PD(α, θ) distribution. A remarkable result ob- ♯ tained in [37, 41, 43] then tells us that the law of (Vi ) coincides with the law of (Vi) in (1.1). Hence for any non-negative measurable function f on Rn, the expectation ♯ ♯ E[f(V1 ,...,Vn )] is given by
n α θ+iα 1 dy1 dyn f(v1,...,vn)cn,α,θ yi− (1 yi) − , (2.5) n { − } [0,1] i=1 i 1 0 where v = v (y ,...,y )= y − (1 y ), with the convention that =1. Note i i 1 n i j=1 − j j=1 that the mapping (y ,...,y ) =: y (v ,...,v ) =: v from [0, 1]n to ∆ has the inverse 1 n → 1 n n
i 1 1 − − y = y (v)= v 1 v (i =1,...,n). (2.6) i i i − j j=1 Therefore, (2.5) becomes
n ∂y α θ+iα 1 c dv dv f(v) y− (1 y ) − . (2.7) n,α,θ 1 n ∂v { i − i } ∆n i=1 Observing from (2.6) that
θ+αn 1 n n n − α θ+iα 1 α y− (1 y ) − = v− 1 v , (2.8) { i − i } i − j i=1 i=1 j=1 we have, by (2.5) and (2.7),
n i 1 1 − − ♯ ♯ E[f(V1 ,...,Vn )] = dv1 dvn f(v)qn,α,θ(v) vi 1 vj , (2.9) Rn − i=1 j=1 where qn,α,θ is defined by (2.3). With the help of (2.4), this proves Theorem 2.1.
It is known that correlation functions appear in the expansion of the ‘probability generating function’ of a random point process δXi ; see Section 5 in [7] for general accounts. This functional is defined to be the expectation of an infinite product of the form g(Xi). For the sake of clarity, we provide the following definition of infinite The two-parameter Poisson–Dirichlet point process 1087 products. Given a sequence t of complex numbers, define { i} n lim (1 + ti) [1, ], if ti 0 (i =1, 2,...), n ∞ →∞ ∈ ∞ ≥ (1 + t )= i=1 i n i=1 lim (1 + ti) C, if the limit exists. n ∈ →∞ i=1 In general, it holds that
∞ ∞ ∞ 1 (1 + t )=1+ t t =1+ t t . (2.10) | i| | i1 | | in | n! | i1 | | in | i=1 n=1 i1<