Scale Mixtures and Slash Distributions 1 Introduction

Scale Mixtures and Slash Distributions Miguel Martins Felgueiras CEAUL e ESTG do Instituto Polit¶ecnicode Leiria Abstract Pareto scale mixtures are very e®ective for modeling heavy tailed data. A new class of models is described, generalizing commonly used slash distributions. Mixture properties and possible applications are discussed. keywords: Pareto distributions, scale mixtures, slash distributions. AMS: 60E05 1 Introduction Classical models assume a ¯xed scale parameter. However, in many situations it is advisable to randomize the scale parameter, with increased variability (Johnson et al., 1992) | for instance, in biostatistical studies the negative binomial model is sometimes referred to as a \more ﬂexible Poisson" since it is the result of modeling the number of eggs laid by females of certain species, the individual being P oisson(¸), but considering that the ¸'s are values from a Gamma(®; ±) random variable. This procedure leads to a hierarchical model randomizing the former one, and hence more ﬂexible. In many applications the Gamma(®; ±) is considered a suitable scale mixing model, because its natural connection with the Laplace transforms brings in a useful toolbox of ready-to-use formulas, and in many cases the resulting mixture is reasonably tractable. But any positive random variables 1 can be used to randomize a scale parameter, although in most cases the resulting mixture is di±cult to work with, since usually the corresponding density functions are not expressable in a close form. The family of Pareto distributions emerges as interesting randomization candidate, for two main reasons. First, it has a simple analytical form, leading to easy mixture densities computation. Second, Pareto's fat tail implies that the resulting densities will have higher kurtosis, useful in heavy tailed data modeling. The mixture can be de¯ned (following Kelker's (1971) notation) as Y = £X (1) where £;X are independent random variables with X absolutely continuous and £ s P areto (®) ; ¡®¡1 f£ (θ) = ®θ ; θ ¸ 1; ® > 0: The fact that we use Pareto with left-endpoint ®£ = 1 is in a sense a severe restriction, since it implies that P[jY j > jXj] = 1. Pareto random variables £e = £¡1 with support θ ¸ 0 could also be considered, covering all positive values. However, explicit density functions and interesting mixture distributions were not found in that more general setting. On the other hand, as θ > 1, the above mentioned expansion has important consequences tied to stochastic ordering. 2 Mixture densities and other properties The probability density function of the mixture Y = £X can be written as Z 1 ¡®¡2 ¡ y ¢ fY (y) = ®θ fX θ dθ; (2) 1 2 originating for some usual X distributions the incomplete gamma and beta based densities (Felgueiras, 2008) presented in table 1. Since the support of £ is S£ = [1; 1[; multiplying X by £ implies expansion of the X values. Clearly, the absolute values of the existing moments of such mixtures are always greater than the corresponding X moments. Further, P (Y > t) > P (X > t) () F Y (t) > F X (t) ; t > 0; i.e. Y stochastic dominates X, a potentially important fact in reliability modeling and in premium computing policies in actuarial applications (Centeno and Andrade e Silva, 2001). When ® increases, 8 < ¡® 0; θ > 1 lim F £® (θ) = lim θ = ®!+1 ®!+1 : 1; θ = 1 and £® converges to the degenerate random variable at 1. Convergence in distribution to a constant implies convergence in probability, and by convergence in probability properties, when ® ! +1 then d Y = £®X ¡! X: (3) ®!1 Thus, the mixture model can be near the original, for large values of ®; or more far apart when ® is small, leading to a wide range of solutions. 3 Mixture and slash distribution extensions The mixture can also be regarded as a random variable quotient, X Y = £X = ; (4) £¡1 3 Table 1: Some Pareto scale mixtures densities Distribution Density Mixture density 2 ³ ´ x y2 1 ¡ ®20:5®¡1γ ®+1 ; X » N (0; 1) f (x) = p e 2 2 2 X fY (y) = p ; y 6= 0 2¼ ¼ jyj®+1 0 1 h i 2 3+¯ 2 ¯+1 ¡ 1+¯ @ 1+¯ A 2 2 exp ¡0:5 jxj ®(1+¯)γ 2 (®+1);0:5jyj ³ ´ ; ¡ 1 < ¯ · 1 3+¯ fY (y) = ¯+1 µ ¶ ; y 6= 0 ¡® 3+¯ ¡ 2 2 ®+1 2 4¡ 2 jyj ¡®¡1 ® 1 1 ®y R y z X » Cauchy(0; 1) fX (x) = f (y) = dz; y 6= 0 ¼ 1 + x2 Y ¼ 0 1 + z2 1 ®y¡®¡1 X » Gama(¯; 1) f (x) = x¯¡1e¡x f (y) = γ (® + ¯; y) ; y > 0 X ¡(¯) Y ¡(¯) 8 > ®B (p + ®; q; y) > ; 0 < y < 1 > y®+1B(p; q) (1 ¡ x)q¡1 < X » Beta(p; q) fX (x) = fY (y) = x1¡pB(p; q) > > ®B (p + ®; q) :> ; y ¸ 1 y®+1B(p; q) ¡ ¢ ¡1 ¯ ¯¡1 ¡x¯ ®γ ®¯ + 1; y X » W eibull (¯; 1) fX (x) = ¯x e f (y) = ; y > 0 Y y®+1 8 > 2 ¡®¡1 > ® y ln y; ® = ¯; y > 0 <> X » P areto (¯) f (x) = ¯x¡¯¡1 f (y) = X Y > ¡ ¢ > ®¯ y¡®¡1 ¡ y¡¯¡1 :> ; ® 6= ¯; y > 0 ¯ ¡ ® 4 where ¡ ¡1¢ ¡2 ®¡1 f£¡1 (θ) = f£ θ θ = ®θ ; 0 < θ · 1; ® > 0; and so £¡1 s Beta(®; 1): (5) When ® = 1; the expressions above simplify, and since £¡1 s U (0; 1) we obtain slash distribution family, often used in reliability and robustness studies (G¶omez et al, 2007; Johnson et al., 1994). In this context, it is obvious that Pareto scale mixtures generalize the class of slash distributions, and therefore share their wide range of applications, namely in situations where symmetrical distributions with fat tails are appropriated. For 0 < ® < 1, Pareto scale mixtures have heavier tailweight than the slash distributions, and for ® > 1 we have the reverse situation. As a side result, we prove that slash distributions do not have mean value. Theorem 1. Let Y = £X, where £;X are independent random variables, X is absolutely continuous and £ s P areto (1) : Then Y does not have mean value. Proof. When E (X) = C 6= 0; then if Y mean exists E (Y ) = E (£) E (X) = cE (£) : Since E (£) does not exists for £ s P areto (1) ; then it is obvious that also Y mean does not exists. For E (X) = 0; note that Z +1 ³ ´ Z +1 ³ ´ ¡3 y fX (x) y fY (y) = θ fX dθ = f£ dx = 1 θ ¡1 jxj x Z Z +1 ³ ´¡2 +1 fX (x) y 1 y = dx = 2 jxjfX (x) dx; > 1 ¡1 jxj x y ¡1 x 5 leading to 8 R <> 1 y y2 0 xfX (x) dx; y > x > 0; y > 0 fY (y) = R : :> 1 0 y2 y ¡xfX (x) dx; y < x < 0; y < 0 The expectation of Y exists if and only if Z · Z ¸ Z · Z ¸ 0 1 0 +1 1 y E (jY j) = jyj 2 ¡xfX (x) dx dy + jyj 2 xfX (x) dx dy ¡1 y y 0 y 0 is convergent. In what concerns the second integral in the right hand side of that expression Z · Z ¸ Z ·Z ¸ +1 1 y +1 1 y jyj 2 xfX (x) dx dy = xfX (x) dx dy; 0 y 0 0 y 0 and using straightforward inequalities, Z ·Z ¸ Z ·Z ¸ +1 1 y +1 1 y xfX (x) dx dy > xfX (x) dx dy > 0 y 0 1 y 1 Z ·Z ¸ +1 1 y > fX (x) dx dy = 1 y 1 Z +1 1 = [FX (y) ¡ FX (1)] dy; 1 y 1 as lim y £ [FX (y) ¡ FX (1)] = 1 ¡ FX (1) = C > 0 we conclude that y!+1 y Z +1 1 [FX (y) ¡ FX (1)] dy 1 y is divergent and hence the expectation of Y doesn't exist. 4 Examples 4.1 Pareto mixtures of normal random variables Pareto mixtures of normals show the important features of Pareto mixtures of a symmetrical population, and are potentially the more widely useful. In 6 fact, when X » N (0; 1) we obtain an in¯nitely divisible mixture (Kelker, 1971) with density µ ¶ ® + 1 y2 f (y) = ®20:5®¡1 jyj¡®¡1 ¼¡0:5γ ; ; y 6= 0; (6) Y 2 2 where Z y γ (a; y) = ta¡1e¡tdt: (7) 0 For instance, for ® = 1 y2 ¡ 1 ¡ e 2 fY (y) = p ; y 6= 0; (8) 2¼y2 and for ® = 3 ³ ´ 3 2 ¡ (2 + y2) e¡y2=2 fY (y) = p ; y 6= 0: (9) 2¼y4 d As previously stated, £®X ¡! X: This can be seen in the graphical repre- ®!1 sentation below Figure 1: Some non convex gaussian mixtures densities 0.4 0.3 0.2 0.1 -4 -2 0 2 4 The thick line represents N(0; 1) and the other lines the mixture for ® = 1; :::; 5; 20; 30. 7 Note that the ® parameter works in a rather similar way as the n parameter in t-Student distributions. However, in this situation, the Y distribution as heavier tails (for small values of ®) and the rate of convergence towards the gaussian limit is slower than in the t family. Another symmetrical mixture with even heavier tails can be generated for X » Cauchy(0; 1) and ® = 1; originating the slash Cauchy density ln (y2 + 1) f (y) = ; y 6= 0: (10) Y 2¼y2 In the next table, we can observe that Cauchy and slash gaussian quantiles are not far apart, but the slash Cauchy has impressive larger quantiles, and therefore can be useful in modeling very extreme situations. Table 2: Probability quantiles for the Cauchy, the slash gaussian and the slash Cauchy ® 0.5 0.75 0.90 0.95 0.99 0.999 q® Cauchy 0 1.00 3.08 6.31 31.82 318.31 q® slash gaussian 0 1.47 3.99 7.98 39.89 398.94 q® slash Cauchy 0 2.45 10.75 27.46 200.57 2850.55 4.2 Pareto mixtures of positive random variables To exemplify Pareto mixtures of positive random variables we choose expo- nential parent, since it exhibits the more important features of mixtures of a positive support population, and it is the more readily useful in applications.

Scale Mixtures and Slash Distributions 1 Introduction

Details

Download

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

Support