mathematics
Article Box-Cox Gamma-G Family of Distributions: Theory and Applications
Abdulhakim A. Al-Babtain 1, Ibrahim Elbatal 2, Christophe Chesneau 3,* and Farrukh Jamal 4 1 Department of Statistics and Operations Research, King Saud University, Riyadh 11362, Saudi Arabia; [email protected] 2 Department of Mathematics and Statistics, College of Science, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh 11432, Saudi Arabia; [email protected] 3 Department of Mathematics, Université de Caen, LMNO, Campus II, Science 3, 14032 Caen, France 4 Department of Statistics, The Islamia University of Bahawalpur, Punjab 63100, Pakistan; [email protected] * Correspondence: [email protected]; Tel.: +33-02-3156-7424
Received: 14 September 2020; Accepted: 12 October 2020; Published: 16 October 2020
Abstract: This paper is devoted to a new class of distributions called the Box-Cox gamma-G family. It is a natural generalization of the useful Risti´c–Balakrishnan-G family of distributions, containing a wide variety of power gamma-G distributions, including the odd gamma-G distributions. The key tool for this generalization is the use of the Box-Cox transformation involving a tuning power parameter. Diverse mathematical properties of interest are derived. Then a specific member with three parameters based on the half-Cauchy distribution is studied and considered as a statistical model. The method of maximum likelihood is used to estimate the related parameters, along with a simulation study illustrating the theoretical convergence of the estimators. Finally, two different real datasets are analyzed to show the fitting power of the new model compared to other appropriate models.
Keywords: generalized distribution; Box-Cox transformation; mathematical properties; maximum likelihood estimation; half-Cauchy distribution
MSC: 60E05; 62E15; 62F10
1. Introduction Due to a lack of flexibility, common (probability) distributions do not allow the construction of convincing statistical models for a large panel of datasets. This drawback has been the driving force behind numerous studies introducing new families’ flexible distributions. One of the most popular approaches to define such families is by using a so-called generator. In this regard, we refer the reader to the Marshall-Olkin-G family by [1], the exp-G family by [2], the beta-G family by [3], the gamma-G family by [4], the Kumaraswamy-G family by [5], the Risti´c–Balakrishnan(RB)-G family (also called gamma-G type 2) by [6], the exponentiated generalized-G family by [7], the logistic-G family by [8], the transformerX (TX)-G family by [9], the Weibull-G family by [10], the exponentiated half-logistic-G family by [11], the odd generalized exponential-G family by [12], the odd Burr III-G family by [13], the cosine-sine-G family by [14], the generalized odd gamma-G family by [15], the extended odd-G family by [16], the type II general inverse exponential family by [17], the truncated Cauchy power-G family by [18], the exponentiated power generalized Weibull power series-G family by [19], the exponentiated truncated inverse Weibull-G family by [20], the ratio exponentiated general-G family by [21] and the Topp-Leone odd Fréchet-G family by [22]. In this article, we propose a new family of distributions based on a novel and motivated generator. As a first comment, we can say that it generalizes, in a certain sense, the RB-G family by [6] thanks to
Mathematics 2020, 8, 1801; doi:10.3390/math8101801 www.mdpi.com/journal/mathematics Mathematics 2020, 8, 1801 2 of 24 the use of the Box-Cox transformation. In order to substantiate this claim, the RB-G family is briefly presented below. Let G(x; Ψ) be a cumulative distribution function (cdf) of a baseline continuous distribution, where Ψ denotes one or more parameters. Then, the RB-G family is governed by the following cdf:
1 − F(x; δ, Ψ) = 1 − γ δ, log[G(x; Ψ) 1] , x ∈ R, (1) Γ(δ)
R u δ−1 −t where δ > 0, γ(δ, u) denotes the lower incomplete gamma function, that is, γ(δ, u) = 0 t e dt with u > 0, and Γ(δ) = limu→+∞ γ(δ, u). When δ is an integer, the cdf in (1) corresponds to the one of the δ-th lower record value statistic with underlying cdf G(x; Ψ). Beyond this characterization, the RB-G family has been used successfully in many applied situations, providing efficient solutions for modeling different types of phenomena through observed data. In this regard, we refer to the extensive survey of [23], and the references therein. The impact of the RB-G family in the statistical society motivates the study of natural extensions, widening its field of application in a certain sense. The idea of this article is to extend the RB-G family by replacing the logarithm transformation of G(x; Ψ)−1 in (1) by the Box-Cox transformation of G(x; Ψ)−1 depending on a tuning depth parameter −1 −1 λ > 0. More precisely, we replace log[G(x; Ψ) ] by bλ G(x; Ψ) , where bλ(y) denotes the Box-Cox λ transformation specified by bλ(y) = (y − 1)/λ, with y ≥ 1. We thus consider the cdf given by
1 G(x; Ψ)−λ − 1 F(x; λ, δ, Ψ) = 1 − γ δ, , x ∈ R. (2) Γ(δ) λ
The motivations and interests behind this idea are as follows: −1 −1 (i) When λ → 0, we have bλ G(x; Ψ) = log[G(x; Ψ) ] and the cdf F(x; λ, δ, Ψ) given by (2) becomes (1). −1 (ii) When λ = 1, we have bλ G(x; Ψ) = (1 − G(x; Ψ))/G(x; Ψ), which corresponds to (the inverse of) the odd transformation of G(x; Ψ), widely used this last decades in distributions theory. In the context of the gamma-G scheme, we may refer to [15,24,25]). (iii) In full generality, the consideration of power transform of G(x; Ψ) increases its statistical possibilities, and those of the related family as well. See, for instance, Ref. [2] for the former exp-G family. (iv) Recently, the Box-Cox transformation was implicitly used in the construction of the Muth-G family proposed by [26], with a total success in applications. In a sense, this study gives the green light for more in the application of the Box-Cox transformation for defining original families of distributions. For the purposes of this study, the family characterized by the cdf (2) is called the Box-Cox gamma-G, abbreviated as BCG-G for the sake of conciseness. Our aim is to study the BCG-G family in detail, examining both theoretical and practical aspects, with discussions. The theory includes the asymptotic behavior of the fundamental functions with discussion on their curvatures, some results in distribution, the analytical definition of the quantile function (qf), two results on stochastic ordering, manageable series expansions of the crucial functions, moments (raw, central and incomplete), mean deviations, mean residual life function, moment generating function, Rényi entropy, various distributional results on the order statistics and generalities on the maximum likelihood approach. In the practical work, the half-Cauchy cdf is considered for G(x; Ψ), offering a new solution for modelling data presenting a highly skewed distribution to the right. The maximum likelihood approach is successfully employed to estimate the model parameters. Based on this approach, we show that our model outperforms the fitting behavior of well established competitors. All of these facts highlight the importance of the new family. This article is made up of the following sections. In Section2, we present the probability density and the hazard rate functions. Next, some special members of the BCG-G family are listed. In Section3, Mathematics 2020, 8, 1801 3 of 24 we derive some mathematical properties of the BCG-G family. Section4 is devoted to a member of the BCG-G family defined with the half-Cauchy distribution as the baseline. Section5 concerns the statistical inference of this new distribution through the maximum likelihood method. Analysis of two practical datasets is also performed. The article ends in Section6.
2. Presentation of the BCG-G Family First, we recall that the BCG-G family is specified by the cdf F(x; λ, δ, Ψ) in (2). When differentiating, its probability density function (pdf) is obtained as
− −λ −λ δ 1 − G(x;Ψ) −1 1 − − G(x; Ψ) − 1 λ f (x; λ, δ, Ψ) = g(x; Ψ)G(x; Ψ) λ 1 e , x ∈ R. (3) Γ(δ) λ
When the baseline distribution is a lifetime distribution, another fundamental function is the hazard rate function (hrf) defined by
f (x; λ, δ, Ψ) h(x; λ, δ, Ψ) = 1 − F(x; λ, δ, Ψ) − −λ −λ δ 1 − G(x;Ψ) −1 1 − − G(x; Ψ) − 1 λ = g(x; Ψ)G(x; Ψ) λ 1 e , x ∈ R. (4) G(x;Ψ)−λ−1 λ γ δ, λ
Here, 1 − F(x; λ, δ, Ψ) corresponds to the survival function of the BCG-G family. One can also expressed the reversed and cumulative hrfs defined by }(x; λ, δ, Ψ) = f (x; λ, δ, Ψ)/F(x; λ, δ, Ψ) and Ω(x; λ, δ, Ψ) = − log[1 − F(x; λ, δ, Ψ)], respectively. Some special members of the BCG-G family are presented in Table1, taking classical baseline distributions with various supports and number of parameters. To our knowledge, none of them has ever been evoked in the literature.
Table 1. Some representatives of the Box-Cox gamma-G (BCG-G) family.
Baseline Distribution Support New Cdf Parameters −λ ( ) − 1 (x/θ) −1 ( ) Uniform 0, θ 1 Γ(δ) γ δ, λ λ, δ, θ −λ ( ) − 1 (log x/ log θ) −1 ( ) Benford 1, θ 1 Γ(δ) γ δ, λ λ, δ, θ −θx −λ ( + ) − 1 (1−e ) −1 ( ) Exponential 0, ∞ 1 Γ(δ) γ δ, λ λ, δ, θ −θxα −λ ( + ) − 1 (1−e ) −1 ( ) Weibull 0, ∞ 1 Γ(δ) γ δ, λ λ, δ, θ, α k −λ ( + ) − 1 [1−(θ/x) ] −1 ( ) Pareto θ, ∞ 1 Γ(δ) γ δ, λ λ, δ, θ, k λθx−α ( + ) − 1 e −1 ( ) Fréchet 0, ∞ 1 Γ(δ) γ δ, λ λ, δ, θ, α c −k −λ ( + ) − 1 {1−[1+(x/s) ] } −1 ( ) Burr XII 0, ∞ 1 Γ(δ) γ δ, λ λ, δ, s, k, c −λ ( + ) − 1 [γ(θ,x)/Γ(θ)] −1 ( ) Gamma 0, ∞ 1 Γ(δ) γ δ, λ λ, δ, θ −λ ( + ) − 1 [(2/π) arctan(x/θ)] −1 ( ) half-Cauchy 0, ∞ 1 Γ(δ) γ δ, λ λ, δ, θ −(x−µ)/s λ − 1 [1+e ] −1 ( ) Logistic R 1 Γ(δ) γ δ, λ λ, δ, µ, s −λ − 1 (Φ((x−µ)/σ)) −1 ( ) Normal R 1 Γ(δ) γ δ, λ λ, δ, µ, σ −(x−µ)/σ − 1 exp(λe )−1 ( ) Gumbel R 1 Γ(δ) γ δ, λ λ, δ, µ, σ −λ − 1 [(1/π) arctan((x−µ)/θ)+1/2] −1 ( ) Cauchy R 1 Γ(δ) γ δ, λ λ, δ, µ, θ
Let us mention that the BCG-G member defined with the half-Cauchy cdf as the baseline will be in the center of our practical investigations in Section4 (for reasons which will be explained later). Mathematics 2020, 8, 1801 4 of 24
3. General Mathematical Properties This section deals with some mathematical properties of the BCG-G family.
3.1. Asymptotic Behavior We now study the asymptotic behavior of the BCG-G cdf F(x; λ, δ, Ψ), pdf f (x; λ, δ, Ψ) and hrf h(x; λ, δ, Ψ) given by (2), (3) and (4), respectively. Among other things, this allows us to understand the impact of the parameters λ and δ, as well as the baseline distribution, on the tails of the corresponding distributions. R +∞ δ−1 −t δ−1 −u Thus, in the case G(x; Ψ) → 0, using the following equivalence result: u t e dt ∼ u e for u → ∞, the following equivalences hold:
( )−λ− 1 − G x;Ψ 1 F(x; λ, δ, Ψ) ∼ G(x; Ψ)−λ(δ−1)e λ , Γ(δ)λδ−1 and ( )−λ− 1 − G x;Ψ 1 f (x; λ, δ, Ψ) ∼ h(x; λ, δ, Ψ) ∼ g(x; Ψ)G(x; Ψ)−λδ−1e λ . Γ(δ)λδ−1
In the case G(x; Ψ) → 1, the following equivalence results: γ(δ, u) ∼ uδ/δ and (1 − u)−λ ∼ 1 + λu for u → 0, give
δ 1 G(x; Ψ)−λ − 1 1 F(x; λ, δ, Ψ) ∼ 1 − ∼ 1 − (1 − G(x; Ψ))δ, Γ(δ)δ λ Γ(δ)δ
δ−1 1 G(x; Ψ)−λ − 1 1 f (x; λ, δ, Ψ) ∼ g(x; Ψ) ∼ g(x; Ψ)(1 − G(x; Ψ))δ−1 Γ(δ) λ Γ(δ) and
g(x; Ψ) h(x; λ, δ, Ψ) ∼ δ . 1 − G(x; Ψ)
Thus, the asymptotic behavior of h(x; λ, δ, Ψ) is proportional to the baseline hrf, with δ as the coefficient of proportionality. Moreover, we can notice that, for the three functions, the impact of λ is strong when G(x; Ψ) → 0, while it is nonexistent when G(x; Ψ) → 1. Thus, it plays an important role in modulating the degree of asymmetry of the corresponding distribution.
3.2. Shapes of the BCG-G Pdf and the Hrf In the following, we analytically describe the shapes of the BCG-G pdf and hrf. A critical point of the BCG-G pdf is obtained as the root of the following equation: ∂ log[ f (x; λ, δ, Ψ)]/∂x = 0, where
∂ ∂g(x; Ψ)/∂x g(x; Ψ) G(x; Ψ)−λ−1g(x; Ψ) log[ f (x; λ, δ, Ψ)] = − (λ + 1) − λ(δ − 1) ∂x g(x; Ψ) G(x; Ψ) G(x; Ψ)−λ − 1 + G(x; Ψ)−λ−1g(x; Ψ). Mathematics 2020, 8, 1801 5 of 24
In view of these equations, we have no guarantee for the uniqueness of a critical point for any G(x; Ψ); more than one root can exist. Now, let ξ(x) = ∂2 log[ f (x; λ, δ, Ψ)]/∂x2. Then, we can express it as
(∂2g(x; Ψ)/∂x2)g(x; Ψ) − (∂g(x; Ψ)/∂x)2 (∂g(x; Ψ)/∂x)G(x; Ψ) − g(x; Ψ)2 ξ(x) = − (λ + 1) g(x; Ψ)2 G(x; Ψ)2 (∂g(x; Ψ)/∂x)G(x; Ψ)(1 − G(x; Ψ)λ) − g(x; Ψ)2(1 − (λ + 1)G(x; Ψ)λ) − λ(δ − 1) G(x; Ψ)2(1 − G(x; Ψ)λ)2 − (λ + 1)G(x; Ψ)−λ−2g(x; Ψ)2 + G(x; Ψ)−λ−1∂g(x; Ψ)/∂x.
If x = x0 denotes a critical point, then it corresponds to a “local maximum” if ξ(x0) < 0, a “local minimum” if ξ(x0) > 0 and a “point of inflection” if ξ(x0) = 0. Likewise, a critical point of the BCG-G hrf is obtained as the root of the following equation: ∂ log[h(x; λ, δ, Ψ)]/∂x = 0, where
∂ log[h(x; λ, δ, Ψ)] = ∂x ∂g(x; Ψ)/∂x g(x; Ψ) G(x; Ψ)−λ−1g(x; Ψ) − (λ + 1) − λ(δ − 1) + G(x; Ψ)−λ−1g(x; Ψ) g(x; Ψ) G(x; Ψ) G(x; Ψ)−λ − 1 δ−1 ( )−λ− 1 G(x; Ψ)−λ − 1 − G x;Ψ 1 + g(x; Ψ)G(x; Ψ)−λ−1 e λ . G(x;Ψ)−λ−1 λ γ δ, λ
Again, more than one root can exist. Let ζ(x) = ∂2 log[h(x; λ, δ, Ψ)]/∂x2. Then, after some development, we get
(∂2g(x; Ψ)/∂x2)g(x; Ψ) − (∂g(x; Ψ)/∂x)2 (∂g(x; Ψ)/∂x)G(x; Ψ) − g(x; Ψ)2 ζ(x) = − (λ + 1) g(x; Ψ)2 G(x; Ψ)2 (∂g(x; Ψ)/∂x)G(x; Ψ)(1 − G(x; Ψ)λ) − g(x; Ψ)2(1 − (λ + 1)G(x; Ψ)λ) − λ(δ − 1) G(x; Ψ)2(1 − G(x; Ψ)λ)2 − (λ + 1)G(x; Ψ)−λ−2g(x; Ψ)2 + G(x; Ψ)−λ−1∂g(x; Ψ)/∂ x δ−1 1 G(x; Ψ)−λ − 1 + (∂g(x; Ψ)/∂x)G(x; Ψ)−λ−1 G(x;Ψ)−λ−1 λ γ δ, λ δ−1 G(x; Ψ)−λ − 1 − (λ + 1)g(x; Ψ)2G(x; Ψ)−λ−2 λ δ−2 G(x; Ψ)−λ − 1 − (δ − 1)g(x; Ψ)2G(x; Ψ)−2(λ+1) λ δ−1 ( )−λ− G(x; Ψ)−λ − 1 − G x;Ψ 1 + g(x; Ψ)2G(x; Ψ)−2(λ+1) e λ λ 2(δ−1) ( )−λ− 1 G(x; Ψ)−λ − 1 −2 G x;Ψ 1 + g(x; Ψ)2G(x; Ψ)−2(λ+1) e λ . G(x;Ψ)−λ−1 2 λ γ δ, λ
The If x = x0 denotes such a critical point, then its nature depends on the sign of ζ(x0); it is a local maximum if ζ(x0) < 0, a local minimum if ζ(x0) > 0 and a point of inflection if ζ(x0) = 0. Finally, let us mention that the above critical points can be determined using any mathematical software (R, Python, Mathematica, Maple. . . ). Mathematics 2020, 8, 1801 6 of 24
3.3. Basic Distributional Results Some distributional results involving the BCG-G family are now presented. Let Y be a random variable (rv) with the gamma distribution with parameters 1 and δ, that is with pdf given as r(x) = δ−1 −x (1/Γ(δ))x e , x > 0 and QG(y; Ψ) be the qf associated to G(x; Ψ), thus satisfying the following nonlinear equation: G[QG(y; Ψ); Ψ] = QG[G(y; Ψ); Ψ] = y with y ∈ (0, 1). Then, the rv X defined by −1/λ X = QG [1 + λY] ; Ψ has the BCG-G pdf given by (3). Moreover, if X is a rv has the BCG-G pdf, then the following rv: G(X; Ψ)−λ − 1 Y = b [G(X; Ψ)−1] = λ λ follows the gamma distribution with the parameters 1 and δ.
3.4. Quantile Function with Applications to Asymmetry and Kurtosis Many practical purposes require an explicit form for the qf. Here, the qf of the BCG family is
−1/λ h −1 i Q(y; λ, δ, Ψ) = QG 1 + λγ (δ, (1 − y)Γ(δ)) ; Ψ , y ∈ (0, 1), (5) where γ−1(δ, u) is the inverse function of γ(δ, u) with respect to u, as described in [27]. Thus, the first quartile is Q(1/4; λ, δ, Ψ), the median is M = Q(1/2; λ, δ, Ψ), the third quartile is Q(3/4; λ, δ, Ψ) and the H-spread is H = Q(3/4; λ, δ, Ψ) − Q(1/4; λ, δ, Ψ). From (5), one can define diverse robust measures of skewness and kurtosis as the Galton skewness S proposed by [28] and the Moors kurtosis K introduced by [29]. They are defined as
S = [Q(3/4; λ, δ, Ψ) − 2Q(1/2; λ, δ, Ψ) + Q(1/4; λ, δ, Ψ)] H−1 (6) and
K = [Q(7/8; λ, δ, Ψ) − Q(5/8; λ, δ, Ψ) + Q(3/8; λ, δ, Ψ) − Q(1/8; λ, δ, Ψ)] H−1, (7) respectively. Then, the considered BCG-G distribution is left skewed, symmetric or right skewed according to S < 0, S = 0 or S > 0, respectively. As for K, it measures the degree of tail heaviness. The main advantages of S and K are to (i) be robust to eventual outliers and (ii) always exist, whatever the existence or not of moments.
3.5. Stochastic Ordering We now describe two stochastic ordering results on the BCG-G family, one is related to the parameter λ and the other to the parameter δ. First, for any λ > 0, based on (1) and (2), the following first-order stochastic dominance holds:
F(x; λ, δ, Ψ) ≤ F(x; δ, Ψ).
x −1 −1 Indeed, we have e ≥ 1 + x for all x ∈ R, implying that bλ[G(x; Ψ) ] ≥ log[G(x; Ψ) ], and the function γ(δ, u) is increasing with respect to u, implying the desired result. Therefore, the BCG-G family first-order stochastically dominates the RB-G family. We now reveal how two different members of the BCG-G family with the same baseline but with different parameters can be compared. Let δ1, δ2 > 0, and X1 and X2 be two rvs such that X1 has the BCG-G pdf f (x; λ, δ1, Ψ) and X2 has the BCG-G pdf f (x; λ, δ2, Ψ). Then, if δ2 ≤ δ1, the ratio function U(x; λ, δ1, δ2, Ψ) = f (x; λ, δ1, Ψ)/ f (x; λ, δ2, Ψ) is decreasing with respect to x since
−λ δ1−δ2−1 ∂ Γ(δ2) G(x; Ψ) − 1 −λ−1 U(x; λ, δ1, δ2, Ψ) = (δ2 − δ1) G(x; Ψ) g(x; Ψ) ≤ 0. ∂x Γ(δ1) λ Mathematics 2020, 8, 1801 7 of 24
Therefore, X2 is stochastically greater than X1 with respect to the likelihood ratio order. Others stochastic ordering information can be derived, as those listed in [30].
3.6. Manageable Series Expansions This part is devoted to manageable series expansions of the BCG-G cdf and pdf, allowing a vast exploration of their mathematical properties. First, let us determine a series expansion for the BCG-G cdf. By virtue of the classical exponential series expansion, we get
Z u +∞ (−1)k Z u +∞ (−1)k γ(δ, u) = tδ−1e−tdt = tk+δ−1dt = uk+δ. ∑ ∑ ( + ) 0 k=0 k! 0 k=0 k! k δ
Therefore,
k+δ 1 +∞ (−1)k G(x; Ψ)−λ − 1 F(x; λ, δ, Ψ) = 1 − ( ) ∑ ( + ) Γ δ k=0 k! k δ λ 1 +∞ (−1)k k+δ = 1 − 1 − G(x; Ψ)λ G(x; Ψ)−λ(k+δ). ( ) δ ∑ ( + ) k Γ δ λ k=0 k! k δ λ
Since G(x; Ψ) ∈ (0, 1), with exclusion of the limit cases, the generalized binomial theorem gives
k+δ +∞ k + δ 1 − G(x; Ψ)λ = (−1)`G(x; Ψ)λ`. ∑ ` `=0
On the other side, by applying the generalized binomial theorem two times in a row, we obtain
+∞ −λ(k + δ − `) G(x; Ψ)−λ(k+δ−`) = ∑ (−1)m(1 − G(x; Ψ))m m=0 m +∞ m −λ(k + δ − `)m = ∑ ∑ (−1)m+uG(x; Ψ)u m=0 u=0 m u +∞ +∞ −λ(k + δ − `)m = ∑ ∑ (−1)m+uG(x; Ψ)u. u=0 m=u m u
Hence, we can write +∞ F(x; λ, δ, Ψ) = 1 − ∑ cuWu(x; Ψ), u=0 where (−1)u +∞ +∞ +∞ (−1)k+`+m k + δ−λ(k + δ − `)m c = u ( ) δ ∑ ∑ ∑ ( + ) k ` Γ δ λ k=0 `=0 m=u k! k δ λ m u u and Wu(x; Ψ) = G(x; Ψ) . One can remark that, for u ≥ 1, Wu(x; Ψ) is the well-known exp-G cdf with power parameter u, as presented in [2]. An alternative series expression is
+∞ F(x; λ, δ, Ψ) = ∑ duWu(x; Ψ), (8) u=0 where d0 = 1 − c0 and, for u ≥ 1, du = −cu. That is, the BCG-G cdf can be expressed as an infinite linear combination of exp-G cdfs. Mathematics 2020, 8, 1801 8 of 24
For further purposes, we now introduce the pdf of the exp-G distribution with power parameter u u + 1; it is defined by wu+1(x; Ψ) = (u + 1)G(x; Ψ) g(x; Ψ). Then, upon differentiation of (8), the following series expansion for f (x; λ, δ, Ψ) is derived:
+∞ f (x; λ, δ, Ψ) = ∑ du+1wu+1(x; Ψ). (9) u=0
For numerical purposes, in many cases, the infinite bound can be substituted by any large integer number with an acceptable loss of precision. The series expansion (9) is manageable, and can be used to determine important features of the BCG-G family, as those related to the moments. Some of them are presented below.
3.7. Moments Hereafter, X denotes a rv having the BCG-G pdf f (x; λ, δ, θ) as defined in (3) and, for any integer u u, Yu denotes a rv having the following exp-G pdf: wu+1(x; Ψ) = (u + 1)G(x; Ψ) g(x; Ψ). Moreover, when a quantity is introduced, it is assumed that it mathematically exists; it is not necessarily the case, depending on the definition for G(x; Ψ) and the values of the parameters. The moments of a distribution are fundamental to define crucial measures, as central and dispersion parameters, as well as skewness and kurtosis measures. Here, we determine the (raw and central) moments for the BCG-G family. The r-th (raw) moment of X is given by
Z +∞ 0 r r µr = E(X ) = x f (x; λ, δ, Ψ)dx, −∞ where E denotes the expectation operator. Applying the expansion given by (9), we can write
+∞ 0 r µr = ∑ du+1E(Yu), (10) u=0
( r) ( r) = R +∞ r ( ) = ( + ) where E Yu is the r-th moments of Yu, that is E Yu −∞ x wu+1 x; Ψ dx u 1 R 1 r u 0 0 [QG(y; Ψ)] y dy. An acceptable numerical evaluation of µr is possible by replacing the infinite bound(s) by any large integer number(s). 0 0 2 0 0 2 The mean and variance of X are given by E(X) = µ1 and V(X) = E((X − µ1) ) = µ2 − (µ1) , 0 0 respectively. Moreover, the r-th central moment of X can be deduced from µ1, ... , µr as follows: r 0 r r k 0 k 0 µr = E (X − µ1) = ∑ (k)(−1) (µ1) µr−k. Important related measures are described below. k=0
3.8. Cumulants, Skewness And Kurtosis
The r-th cumulant of X, say κr, is derived from the following recursion formula:
r−1 r − 1 κ = µ0 − µ0 κ , r r ∑ − r−k k k=1 k 1
0 with κ1 = µ1. The moments skewness and kurtosis of X are defined by
κ3 κ4 γ1 = , γ2 = , (11) 3/2 κ2 κ2 2 respectively. Both can be calculated numerically for a given cdf G(x; Ψ). The related BCG-G distribution is left skewed, symmetric or right skewed according to γ1 < 0, γ1 = 0 or γ1 > 0, respectively. Moreover, the kurtosis γ2 measures its flatness. Thus, they have the same roles to S and K, respectively, but without guarantee of existence. Mathematics 2020, 8, 1801 9 of 24
3.9. Incomplete Moments The r-th incomplete moment of X is defined by
Z t ∗ r r µr (t) = E(X 1{X≤t}) = x f (x; λ, δ, Ψ)dx, t > 0, −∞ where 1A denotes the indicator function over the event A. Using the series expansion given by (9), we also have
+∞ ∗ r µ (t) = d (Y 1 r ) r ∑ u+1E u {Yu≤t} , u=0
r R t r R G(y;Ψ) r u (Y 1 r ) = x w (x )dx = (u + ) [Q (y )] y dy where E u {Yu≤t} −∞ u+1 ; Ψ 1 0 G ; Ψ . From the incomplete moments, several important mathematical quantities related to the BCG-G family can be expressed. Some of them are presented below. First, the mean deviation of X about the mean has the following expression:
0 0 0 ∗ 0 δ 0 = (|X − µ |) = 2µ F(µ ; λ, δ, Ψ) − 2µ (µ ). µ1 E 1 1 1 1 1
Also, the mean deviation of X about the median is given by
0 ∗ δM = E(|X − M|) = µ1 − 2µ1(M).
These two mean deviations can be used as measurements of the degree of dispersion of X. The Bonferroni and Lorenz curves are, respectively, given by
∗ ∗ µ1 [Q(x; λ, δ, Ψ)] µ1 [Q(x; λ, δ, Ψ)] B(x) = 0 , L(x) = 0 , x ∈ (0, 1). µ1Q(x; λ, δ, Ψ) µ1
They have numerous applications in various areas such as econometrics, finance, medicine, demography and insurance. We may refer the reader to [31]. The r-th moment of the residual life for X is given as
Z +∞ r 1 r Ur(t) = E((X − t) | X > t) = (x − t) f (x; λ, δ, Ψ)dx 1 − F(t; λ, δ, Ψ) t and the r-th moment of the reversed residual life for X is defined by
Z t r 1 r Vr(t) = E((t − X) | X ≤ t) = (t − x) f (x; λ, δ, Ψ)dx. F(t; λ, δ, Ψ) −∞
Using the classical binomial formula, they can be expressed in terms of incomplete moments as
1 r r U (t) = (−1)r−ktr−k [µ∗ − µ∗(t)] . r − ( ) ∑ k k 1 F t; λ, δ, Ψ k=0 k and 1 r r V (t) = (−1)ktr−kµ∗(t), r ( ) ∑ k F t; λ, δ, Ψ k=0 k respectively. The mean residual life and mean reversed residual life of X follow by taking r = 1. Further details and applications on the moments of the residual life and reversed residual life of a random variable can be found in [32]. Mathematics 2020, 8, 1801 10 of 24
3.10. Moment Generating Function More general to the moments, the moment generating function of X is given by
Z +∞ M(t; λ, δ, Ψ) = E(etX) = etx f (x; λ, δ, Ψ)dx, t ∈ R. −∞
Using the series expansion given by (9), we also have
+∞ tYu M(t; λ, δ, Ψ) = ∑ du+1E(e ), u=0
+ tYu R ∞ tx R 1 tQG(y;Ψ) u tYu where E(e ) = −∞ e wu+1(x; Ψ)dx = (u + 1) 0 e y dy or, eventually, E(e ) = +∞ v v 0 r r ∑v=0(t /v!)E(Yu ). The r-th moment of X follows from the formula: µr = ∂ M(t; λ, δ, Ψ)/∂t |t=0.
3.11. Rényi Entropy The Rényi entropy introduced by [33] is a measure of variation of the uncertainty used in many areas as engineering, biometrics, quantum information and ecology. Here, we discuss the Rényi entropy of the BCG-G family. Let υ > 0 with υ 6= 1. Then, the Réyni entropy of the BCG-G family is specified by 1 Z +∞ I(υ) = log f (x; λ, δ, Ψ)υdx . (12) 1 − υ −∞ Now, let us notice that
υ(δ−1) ( )−λ− 1 G(x; Ψ)−λ − 1 −υ G x;Ψ 1 f (x; λ, δ, Ψ)υ = g(x; Ψ)υG(x; Ψ)−υ(λ+1) e λ . Γ(δ)υ λ
By virtue of the exponential series expansion, we get
1 +∞ (−1)k υk k+υ(δ−1) f (x; λ, δ, Ψ)υ = g(x; Ψ)υG(x; Ψ)−υ(λ+1) G(x; Ψ)−λ − 1 υ υ(δ−1) ∑ k Γ(δ) λ k=0 k! λ 1 +∞ (−1)k υk k+υ(δ−1) = g(x; Ψ)υ 1 − G(x; Ψ)λ G(x; Ψ)−λk−υ(λδ+1). υ υ(δ−1) ∑ k Γ(δ) λ k=0 k! λ
The generalized binomial theorem yields
1 +∞ +∞ f (x; λ, δ, Ψ)υ = e G(x; Ψ)−λ(k−`)−υ(λδ+1)g(x; Ψ)υ, υ υ(δ−1) ∑ ∑ k,` Γ(δ) λ k=0 `=0 where (−1)k+` υk k + υ(δ − 1) e ` = . k, k! λk ` By putting this expansion into (12), we can express I(υ) as
1 I(υ) = − υ log[Γ(δ)] − υ(δ − 1) log(λ) 1 − υ +∞ +∞ Z +∞ −λ(k−`)−υ(λδ+1) υ + log ∑ ∑ ek,` G(x; Ψ) g(x; Ψ) dx . k=0 `=0 −∞
We can express the last integral term as
Z +∞ Z 1 −λ(k−`)−υ(λδ+1) υ −λ(k−`)−υ(λδ+1) υ−1 G(x; Ψ) g(x; Ψ) dx = y g(QG(y; Ψ); Ψ) dy. −∞ 0 Mathematics 2020, 8, 1801 11 of 24
Numerical evaluation of this integral is feasible.
3.12. Order Statistics The order statistics are classical rvs modelling a wide variety of real-life phenomena. Here, we derive tractable expressions for their pdfs as well as their moments in the context of the BCG-G family. Let X1, ... , Xn be n rvs having the BCG-G pdf. Then, the pdf of the i-th order statistic of X1,..., Xn is given by
n! n−i n − i f (x; λ, δ, Ψ) = (−1)jF(x; λ, δ, Ψ)j+i−1 f (x; λ, δ, Ψ). i:n ( − ) ( − ) ∑ i 1 ! n i ! j=0 j
It follows from the series expansions for F(x; λ, δ, Ψ) and f (x; λ, δ, Ψ) given by (8) and (9), respectively, that
" #j+i−1 n! n−i n − i +∞ +∞ f (x; λ, δ, Ψ) = (−1)j d W (x; Ψ) d w (x; Ψ). (13) i:n ( − ) ( − ) ∑ ∑ u u ∑ v+1 v+1 i 1 ! n i ! j=0 j u=0 v=0
u Since Wu(x; Ψ) = G(x; Ψ) , owing to a result by [34], the following equality holds:
j+i− "+∞ # 1 +∞ ∑ duWu(x; Ψ) = ∑ ξkWk(x; Ψ), u=0 k=0
j+i−1 k with ξk defined by the following recursive formula: ξ0 = d0 and, for k ≥ 1, ξk = (1/(kd0)) ∑ [`(j + `=1 i) − k]d`ξk−`. Now, (13) becomes
n! n−i n − i +∞ +∞ f (x; λ, δ, Ψ) = (−1)j ξ d W (x; Ψ)w (x; Ψ). (14) i:n ( − ) ( − ) ∑ ∑ ∑ k v+1 k v+1 i 1 ! n i ! j=0 j k=0 v=0
Let us note by Xi:n the i-th order statistic of X1, ... , Xn. Then, by using (14), the r-th moment of Xi:n is given by
Z +∞ r r E(Xi:n) = x fi:n(x; λ, δ, Ψ)dx −∞ n! n−i n − i +∞ +∞ Z +∞ = (−1)j ξ d xrW (x; Ψ)w (x; Ψ)dx, ( − ) ( − ) ∑ ∑ ∑ k v+1 k v+1 i 1 ! n i ! j=0 j k=0 v=0 −∞ with
Z +∞ Z +∞ r r k+v x Wk(x; Ψ)wv+1(x; Ψ)dx = (v + 1) x G(x; Ψ) g(x; Ψ)dx −∞ −∞ Z 1 r k+v = (v + 1) [QG(y; Ψ)] y dy. 0
The last integral term can be calculated numerically for most of the considered cdf G(x; Ψ). Proceeding as above, one can derive various mathematical quantities, such as the incomplete moments, mean deviations and the moment generating function of Xi:n. Mathematics 2020, 8, 1801 12 of 24
3.13. Maximum Likelihood: General Formula In this section, we provide the main ingredient for estimating the parameters of the BCG-G model by the maximum approach. Let x1, ... , xn be the observations of n independent and identically distributed rvs having the BCG-G pdf. Then, the log-likelihood function can be expressed as
n `(λ, δ, Ψ) = ∑ log [ f (xi, λ, δ, Ψ)] i=1 n n = −n log [Γ(δ)] + ∑ log [g(xi; Ψ)] − (λ + 1) ∑ log [G(xi; Ψ)] i=1 i=1 n n −λ h −λ i G(xi; Ψ) n + (δ − 1) ∑ log G(xi; Ψ) − 1 − n(δ − 1) log(λ) − ∑ + . i=1 i=1 λ λ
The maximum likelihood estimates (MLEs) for λ, δ and Ψ, say λˆ , δˆ and Ψˆ , respectively, are defined such that `(λˆ , δˆ, Ψˆ ) is maximal. If the first partial derivatives of `(λ, δ, Ψ) with respect to all the parameters exist, the MLEs are the simultaneous solutions of the following equations: ∂`(λ, δ, Ψ)/∂λ = 0, ∂`(λ, δ, Ψ)/∂δ = 0 and ∂`(λ, δ, Ψ)/∂Ψ = 0, with
∂ n n log [G(x ; Ψ)] 1 `(λ, δ, Ψ) = − log [G(x ; Ψ)] − (δ − 1) i − n(δ − 1) ∑ i ∑ − ( )λ ∂λ i=1 i=1 1 G xi; Ψ λ n G(x ; Ψ)−λ log [G(x ; Ψ)] n G(x ; Ψ)−λ n + i i + i − ∑ ∑ 2 2 , (15) i=1 λ i=1 λ λ
∂ Γ0(δ) n h i `(λ, δ, Ψ) = −n + log G(x ; Ψ)−λ − 1 − n log(λ) (16) ( ) ∑ i ∂δ Γ δ i=1 and
∂ n ∂g(x ; Ψ)/∂Ψ n ∂G(x ; Ψ)/∂Ψ `(λ, δ, Ψ) = i − (λ + 1) i ∑ ( ) ∑ ( ) ∂Ψ i=1 g xi; Ψ i=1 G xi; Ψ n (∂G(x ; Ψ)/∂Ψ)G(x ; Ψ)−λ−1 n − λ(δ − 1) i i + (∂G(x ; Ψ)/∂Ψ)G(x ; Ψ)−λ−1. (17) ∑ ( )−λ − ∑ i i i=1 G xi; Ψ 1 i=1
Clearly, closed-form expressions for λˆ , δˆ and Ψˆ do not exist, but they can be determined numerically using iterative methods such as Newton-Raphson algorithms, via any statistical software. For the (asymptotic) confidence interval estimation and statistical tests on the model parameters, the corresponding observed information matrix is required. Hereafter, r denotes the number of components of Ψ, and we set Ψ = (Ψ1, ... , Ψr) and Ψˆ = (Ψˆ 1, ... , Ψˆ r) . Then, the observed information matrix at (λ, δ, Ψ) is obtained as
∂2 ∂2 ∂2 ∂2 2 `(λ, δ, Ψ) `(λ, δ, Ψ) `(λ, δ, Ψ) ... `(λ, δ, Ψ) ∂λ ∂λ∂δ ∂λ∂Ψ1 ∂λ∂Ψr ∂2 ∂2 ∂2 . 2 `(λ, δ, Ψ) `(λ, δ, Ψ) ... `(λ, δ, Ψ) ∂δ ∂δ∂Ψ1 ∂δ∂Ψr ∂2 ∂2 .. 2 `(λ, δ, Ψ) ... `(λ, δ, Ψ) ∂Ψ ∂Ψ1∂Ψr J(λ, δ, Ψ) = − 1 . ...... ...... ∂2 ...... 2 `(λ, δ, Ψ) ∂Ψr (r+2)×(r+2)
For the sake of space, the expressions of the components of this matrix are omitted. Then, under some technical condition of regularity type, when n is large, the distribution of the random estimators behind (λˆ , δˆ, Ψˆ ) can be approximated by the multivariate normal distribution Mathematics 2020, 8, 1801 13 of 24