On Generalized Gamma Distribution and Its Application to Survival Data

Total Page:16

File Type:pdf, Size:1020Kb

On Generalized Gamma Distribution and Its Application to Survival Data International Journal of Statistics and Probability; Vol. 8, No. 5; September 2019 ISSN 1927-7032 E-ISSN 1927-7040 Published by Canadian Center of Science and Education On Generalized Gamma Distribution and Its Application to Survival Data Kiche J1, Oscar Ngesa2 & George Orwa3 1 Mathematics Department, Pan African University, Institute of Basic Sciences, Technology, and Innovation, Kenya 2 Mathematics and Informatics Department, Taita-Taveta University, Kenya 3 Statistics and Actuarial Sciences Department, Jomo Kenyatta University of Agriculture and Technology, Kenya Correspondence: Kiche J, Mathematics Department, Pan African University, Institute of Basic Sciences, Technology, and Innovation, Kenya. E-mail: [email protected] Received: July 7, 2019 Accepted: August 27, 2019 Online Published: August 30, 2019 doi:10.5539/ijsp.v8n5p85 URL: https://doi.org/10.5539/ijsp.v8n5p85 Abstract The generalized gamma distribution is a continuous probability distribution with three parameters. It is a generalization of the two-parameter gamma distribution. Since many distributions commonly used for parametric models in survival analysis (such as the Exponential distribution , the Weibull distribution and the Gamma distribution) are special cases of the generalized gamma, it is sometimes used to determine which parametric model is appropriate for a given set of data. Generalized gamma distribution is one of the distributions used in frailty modeling. In this study , it is shown that generalized gamma distribution has three sub-families and its application to the analysis of a survival data has also been explored. The parametric modeling approach has been carried out to find the expected results. Keywords: generalized gamma, sub-families, Kullback-Leibler divergence 1. Introduction The early generalization of gamma distribution can be traced back to Amoroso(1925)who discussed a generalized gam- ma distribution and applied it to fit income rates. Johnson et al. gave a four parameter generalized gamma distribution which reduces to the generalized gamma distribution defined by Stacy (1962) when the location parameter is set to ze- ro.The generalized gamma defined by Stacy (1962) is a three-parameter exponentiated gamma distribution. Mudholkar and Srivastava (1993) introduced the exponentiated method to derive a distribution. Agarwal and Al-Saleh (2001) applied generalized gamma to study hazard rates. Balakrishnan and Peng (2006) applied this distribution to develop general- ized gamma frailty model.Nadarajah and Gupta (2007) proposed another type of generalized gamma distribution with application to fitting drought data. Cordeiro et al. (2012) derived another generalization of Stacy’s generalized gamma distribution using exponentiated method, and applied it to life time and survival analysis. The generalized gamma distribution presents a flexible family in the varieties of shapes and hazard functions for modeling duration. Distributions that are used in duration analysis in economics include exponential, log-normal, gamma, and Weibull . The generalized gamma family, which encompasses exponential, gamma, and Weibull as subfamilies, and log- normal as a limiting distribution, has been used in economics by Jaggia, Yamaguchi, and Allenby et al . Some authors have argued that the flexibility of generalized gamma makes it suitable for duration analysis, while others have advocated use of simpler models because of estimation difficulties caused by the complexity of generalized gamma parameter structure. Obviously, there would be no need to endure the costs associated with the application of a complex generalized gamma model if the data do not discriminate between the generalized gamma and members of its subfamilies, or if the fit of a simpler model to the data is as good as that for the complex generalized gamma. Hager and Bain inhibited applications of the generalized gamma model. Prentice resolved the convergence problem using a nonlinear transformation of generalized gamma model. Maximum- likelihood estimation of the parameters and quasi maximum likelihood estimators for its subfamily (two-parameter gamma distribution) can be found. Hwang, T. et al introduced a new moment estimation of parameters of the generalized gamma distribution using it’s characterization. In information theory, thus far a maximum entropy (ME) derivation of generalized gamma is found in Kapur, where it is referred to as generalized Weibull distribution, and the entropy of GG has appeared in the context of flexible families of distributions. Some concepts of this family in information theory has introduced by 85 http://ijsp.ccsenet.org International Journal of Statistics and Probability Vol. 8, No. 5; 2019 Dadpay et al . 2. Parametric Distributions and Their Properties 2.1 Log-Normal Distribution A log-normal distribution is a continuous probability distribution of a random variable whose logarithm is normally distributed. If the random variable X is log-normally distributed, then Y = ln(X) has a normal distribution. Likewise, if Y has a normal distribution, then the exponential function of Y, X = exp(Y), has a log-normal distribution. A random variable which is log-normally distributed takes only positive real values. A log-normal process is the statistical realization of the multiplicative product of many independent random variables, each of which is positive. This is justified by considering the central limit theorem in the log domain. The log-normal distribution is the maximum entropy probability distribution for a random variateX for which the mean and variance of ln(X) are specified. A positive random variable X is log-normally distributed if the logarithm of X is normally distributed, ln(X) ∼ N(µ, σ2) The probability density function of the log-normal distribution is given by: 1 1 (lnx−µ)2 fX(x) = p exp(− ) (1) x σ 2π 2σ2 The cumulative distribution is given as − µ = Φ (lnx) FX(x) ( σ ) (2) which may also be expressed as : 1 lnx − µ [1 + er f ( p )] (3) 2 σ 2 where erf is the error function. Approximating formular for the characteristic function '(t) can be given as −!2(−itσ2eµ)+2!(−itσ2eµ) exp( σ2 '(t) ≈ p 2 (4) 1 + !(−itσ2eµ) where ! is the Lambert W function. For a log-normal random variable, the partial expectation is given by 2 µ+ 1 σ2 µ + σ − lnk = 2 Φ g(k) e ( σ ) (5) The quantile for the log-normal distribution is given by p exp(µ + 2σ2er f −1(2F − 1)) (6) while the variance is given as [exp(σ2) − 1]exp(2µ + σ2) (7) The skewness is given by the formular p 2 (eσ + 2) eσ2 − 1 (8) The Ex.kurtosis is then denoted by exp(4σ2) + 2exp(3σ2) + 3exp(2σ2) − 6 (9) 2 ; +1 µ + σ2 µ − σ2 The Supportp is over x (0 ) while the mean is exp( 2 ), the mode is exp( ) and the Entropy is given as µ+ 1 log2(σe 2 2π) 86 http://ijsp.ccsenet.org International Journal of Statistics and Probability Vol. 8, No. 5; 2019 2.2 Weibull Distribution The Weibull distribution is a continuous probability distribution. It is named after Swedish mathematician Waloddi Weibull, who described it in detail in 1951, although it was first identified by Frechet (1927) and first applied by Rosin and Rammler (1933) to describe a particle size distribution. The Weibull distribution is one of the most widely used lifetime distributions in reliability engineering. It is a versatile distribution that can take on the characteristics of other types of distributions, based on the value of the shape parameter. The probability density function of a Weibull random variable is ν ; ξ; ν = x ν−1 −(x/ξ)ν ; ≥ f (x ) ξ ( ξ ) e x 0 (10) where ν > 0 is the shape parameter and ξ > 0 is the scale parameter of the distribution. The cumulative distribution function for the Weibull distribution is ν F(x; ξ; ν) = 1 − e−(x/ξ) (11) The failure rate h (or hazard function) is given by ν ; ξ; ν = x ν−1 h(x ) ξ ( ξ ) (12) The quantile (inverse cumulative distribution) function for the Weibull distribution is Q(p; ξ; ν) = ξ(−ln(1 − p))1/ν (13) The moment generating function of the logarithm of a Weibull distributed random variable is given by tlogX = ξtΓ t + E[e ] (ν 1) (14) The mean and variance of a Weibull random variable can be expressed respectively as = ξΓ + 1 E(X) (1 ν ) (15) and = ξ2 Γ + 2 − Γ + 1 2 Var(X) [ (1 ν ) ( (1 ν )) ] (16) The skewness is given by 3 Γ + ξ3 − µσ2 − µ3 (1 ν ) 3 γ = (17) 1 σ3 where the mean is denoted by µ and the standard deviation is denoted by σ. The excess kurtosis is given by 4 2 2 −6Γ + 12Γ Γ2 − 3Γ − 4Γ1Γ3 + Γ4 γ = 1 1 2 (18) 2 Γ − Γ2 2 [ 2 1] The information entropy is given by ξ ξ; ν = γ − 1 + + H( ) (1 ν ) ln(ν) 1 (19) where γ is the EulerMascheroni constant. The Kullback-Leibler divergence for Weibull distribution is given by ν ν γ ξ ν 1 2 1 ν2 2 DKL(Weib1 k Weib2) = log ν − log ν + (ν1 − ν2)[logξ1 − ] + ( ) Γ( + 1) − 1 (20) ξ 1 ξ 2 ν ξ ν 1 2 1 2 1 The variances and covariances ofν ˆ and ξˆ are estimated from the inverse local Fisher matrix as 0 1− B @2Λ @2Λ C 1 0 1 B − − C B ˆ ν ˆ ν; ξˆ C B ∂ν2 ∂ν∂ξ C B Var(ˆ) Cov(ˆ )C B C B : C = B : C @B AC B C (21) ˆ ν; ξˆ ˆ ξˆ B @2Λ @2Λ C Cov(ˆ ) Var( ) @B− − AC ∂ν∂ξ ∂ξ2 ν=ν,ξˆ =ξˆ 87 http://ijsp.ccsenet.org International Journal of Statistics and Probability Vol. 8, No. 5; 2019 2.3 Exponential Distribution The exponential model, with only one unknown parameter, is the simplest of all life distribution models. The exponential distribution is one of the widely used continuous distributions.
Recommended publications
  • A Random Variable X with Pdf G(X) = Λα Γ(Α) X ≥ 0 Has Gamma
    DISTRIBUTIONS DERIVED FROM THE NORMAL DISTRIBUTION Definition: A random variable X with pdf λα g(x) = xα−1e−λx x ≥ 0 Γ(α) has gamma distribution with parameters α > 0 and λ > 0. The gamma function Γ(x), is defined as Z ∞ Γ(x) = ux−1e−udu. 0 Properties of the Gamma Function: (i) Γ(x + 1) = xΓ(x) (ii) Γ(n + 1) = n! √ (iii) Γ(1/2) = π. Remarks: 1. Notice that an exponential rv with parameter 1/θ = λ is a special case of a gamma rv. with parameters α = 1 and λ. 2. The sum of n independent identically distributed (iid) exponential rv, with parameter λ has a gamma distribution, with parameters n and λ. 3. The sum of n iid gamma rv with parameters α and λ has gamma distribution with parameters nα and λ. Definition: If Z is a standard normal rv, the distribution of U = Z2 called the chi-square distribution with 1 degree of freedom. 2 The density function of U ∼ χ1 is −1/2 x −x/2 fU (x) = √ e , x > 0. 2π 2 Remark: A χ1 random variable has the same density as a random variable with gamma distribution, with parameters α = 1/2 and λ = 1/2. Definition: If U1,U2,...,Uk are independent chi-square rv-s with 1 degree of freedom, the distribution of V = U1 + U2 + ... + Uk is called the chi-square distribution with k degrees of freedom. 2 Using Remark 3. and the above remark, a χk rv. follows gamma distribution with parameters 2 α = k/2 and λ = 1/2.
    [Show full text]
  • Stat 5101 Notes: Brand Name Distributions
    Stat 5101 Notes: Brand Name Distributions Charles J. Geyer February 14, 2003 1 Discrete Uniform Distribution Symbol DiscreteUniform(n). Type Discrete. Rationale Equally likely outcomes. Sample Space The interval 1, 2, ..., n of the integers. Probability Function 1 f(x) = , x = 1, 2, . , n n Moments n + 1 E(X) = 2 n2 − 1 var(X) = 12 2 Uniform Distribution Symbol Uniform(a, b). Type Continuous. Rationale Continuous analog of the discrete uniform distribution. Parameters Real numbers a and b with a < b. Sample Space The interval (a, b) of the real numbers. 1 Probability Density Function 1 f(x) = , a < x < b b − a Moments a + b E(X) = 2 (b − a)2 var(X) = 12 Relation to Other Distributions Beta(1, 1) = Uniform(0, 1). 3 Bernoulli Distribution Symbol Bernoulli(p). Type Discrete. Rationale Any zero-or-one-valued random variable. Parameter Real number 0 ≤ p ≤ 1. Sample Space The two-element set {0, 1}. Probability Function ( p, x = 1 f(x) = 1 − p, x = 0 Moments E(X) = p var(X) = p(1 − p) Addition Rule If X1, ..., Xk are i. i. d. Bernoulli(p) random variables, then X1 + ··· + Xk is a Binomial(k, p) random variable. Relation to Other Distributions Bernoulli(p) = Binomial(1, p). 4 Binomial Distribution Symbol Binomial(n, p). 2 Type Discrete. Rationale Sum of i. i. d. Bernoulli random variables. Parameters Real number 0 ≤ p ≤ 1. Integer n ≥ 1. Sample Space The interval 0, 1, ..., n of the integers. Probability Function n f(x) = px(1 − p)n−x, x = 0, 1, . , n x Moments E(X) = np var(X) = np(1 − p) Addition Rule If X1, ..., Xk are independent random variables, Xi being Binomial(ni, p) distributed, then X1 + ··· + Xk is a Binomial(n1 + ··· + nk, p) random variable.
    [Show full text]
  • On a Problem Connected with Beta and Gamma Distributions by R
    ON A PROBLEM CONNECTED WITH BETA AND GAMMA DISTRIBUTIONS BY R. G. LAHA(i) 1. Introduction. The random variable X is said to have a Gamma distribution G(x;0,a)if du for x > 0, (1.1) P(X = x) = G(x;0,a) = JoT(a)" 0 for x ^ 0, where 0 > 0, a > 0. Let X and Y be two independently and identically distributed random variables each having a Gamma distribution of the form (1.1). Then it is well known [1, pp. 243-244], that the random variable W = X¡iX + Y) has a Beta distribution Biw ; a, a) given by 0 for w = 0, (1.2) PiW^w) = Biw;x,x)=\ ) u"-1il-u)'-1du for0<w<l, Ío T(a)r(a) 1 for w > 1. Now we can state the converse problem as follows : Let X and Y be two independently and identically distributed random variables having a common distribution function Fix). Suppose that W = Xj{X + Y) has a Beta distribution of the form (1.2). Then the question is whether £(x) is necessarily a Gamma distribution of the form (1.1). This problem was posed by Mauldon in [9]. He also showed that the converse problem is not true in general and constructed an example of a non-Gamma distribution with this property using the solution of an integral equation which was studied by Goodspeed in [2]. In the present paper we carry out a systematic investigation of this problem. In §2, we derive some general properties possessed by this class of distribution laws Fix).
    [Show full text]
  • A Form of Multivariate Gamma Distribution
    Ann. Inst. Statist. Math. Vol. 44, No. 1, 97-106 (1992) A FORM OF MULTIVARIATE GAMMA DISTRIBUTION A. M. MATHAI 1 AND P. G. MOSCHOPOULOS2 1 Department of Mathematics and Statistics, McOill University, Montreal, Canada H3A 2K6 2Department of Mathematical Sciences, The University of Texas at El Paso, El Paso, TX 79968-0514, U.S.A. (Received July 30, 1990; revised February 14, 1991) Abstract. Let V,, i = 1,...,k, be independent gamma random variables with shape ai, scale /3, and location parameter %, and consider the partial sums Z1 = V1, Z2 = 171 + V2, . • •, Zk = 171 +. • • + Vk. When the scale parameters are all equal, each partial sum is again distributed as gamma, and hence the joint distribution of the partial sums may be called a multivariate gamma. This distribution, whose marginals are positively correlated has several interesting properties and has potential applications in stochastic processes and reliability. In this paper we study this distribution as a multivariate extension of the three-parameter gamma and give several properties that relate to ratios and conditional distributions of partial sums. The general density, as well as special cases are considered. Key words and phrases: Multivariate gamma model, cumulative sums, mo- ments, cumulants, multiple correlation, exact density, conditional density. 1. Introduction The three-parameter gamma with the density (x _ V)~_I exp (x-7) (1.1) f(x; a, /3, 7) = ~ , x>'7, c~>O, /3>0 stands central in the multivariate gamma distribution of this paper. Multivariate extensions of gamma distributions such that all the marginals are again gamma are the most common in the literature.
    [Show full text]
  • A Note on the Existence of the Multivariate Gamma Distribution 1
    - 1 - A Note on the Existence of the Multivariate Gamma Distribution Thomas Royen Fachhochschule Bingen, University of Applied Sciences e-mail: [email protected] Abstract. The p - variate gamma distribution in the sense of Krishnamoorthy and Parthasarathy exists for all positive integer degrees of freedom and at least for all real values pp 2, 2. For special structures of the “associated“ covariance matrix it also exists for all positive . In this paper a relation between central and non-central multivariate gamma distributions is shown, which implies the existence of the p - variate gamma distribution at least for all non-integer greater than the integer part of (p 1) / 2 without any additional assumptions for the associated covariance matrix. 1. Introduction The p-variate chi-square distribution (or more precisely: “Wishart-chi-square distribution“) with degrees of 2 freedom and the “associated“ covariance matrix (the p (,) - distribution) is defined as the joint distribu- tion of the diagonal elements of a Wp (,) - Wishart matrix. Its probability density (pdf) has the Laplace trans- form (Lt) /2 |ITp 2 | , (1.1) with the ()pp - identity matrix I p , , T diag( t1 ,..., tpj ), t 0, and the associated covariance matrix , which is assumed to be non-singular throughout this paper. The p - variate gamma distribution in the sense of Krishnamoorthy and Parthasarathy [4] with the associated covariance matrix and the “degree of freedom” 2 (the p (,) - distribution) can be defined by the Lt ||ITp (1.2) of its pdf g ( x1 ,..., xp ; , ). (1.3) 2 For values 2 this distribution differs from the p (,) - distribution only by a scale factor 2, but in this paper we are only interested in positive non-integer values 2 for which ||ITp is the Lt of a pdf and not of a function also assuming any negative values.
    [Show full text]
  • Negative Binomial Regression Models and Estimation Methods
    Appendix D: Negative Binomial Regression Models and Estimation Methods By Dominique Lord Texas A&M University Byung-Jung Park Korea Transport Institute This appendix presents the characteristics of Negative Binomial regression models and discusses their estimating methods. Probability Density and Likelihood Functions The properties of the negative binomial models with and without spatial intersection are described in the next two sections. Poisson-Gamma Model The Poisson-Gamma model has properties that are very similar to the Poisson model discussed in Appendix C, in which the dependent variable yi is modeled as a Poisson variable with a mean i where the model error is assumed to follow a Gamma distribution. As it names implies, the Poisson-Gamma is a mixture of two distributions and was first derived by Greenwood and Yule (1920). This mixture distribution was developed to account for over-dispersion that is commonly observed in discrete or count data (Lord et al., 2005). It became very popular because the conjugate distribution (same family of functions) has a closed form and leads to the negative binomial distribution. As discussed by Cook (2009), “the name of this distribution comes from applying the binomial theorem with a negative exponent.” There are two major parameterizations that have been proposed and they are known as the NB1 and NB2, the latter one being the most commonly known and utilized. NB2 is therefore described first. Other parameterizations exist, but are not discussed here (see Maher and Summersgill, 1996; Hilbe, 2007). NB2 Model Suppose that we have a series of random counts that follows the Poisson distribution: i e i gyii; (D-1) yi ! where yi is the observed number of counts for i 1, 2, n ; and i is the mean of the Poisson distribution.
    [Show full text]
  • Modeling Overdispersion with the Normalized Tempered Stable Distribution
    Modeling overdispersion with the normalized tempered stable distribution M. Kolossiatisa, J.E. Griffinb, M. F. J. Steel∗,c aDepartment of Hotel and Tourism Management, Cyprus University of Technology, P.O. Box 50329, 3603 Limassol, Cyprus. bInstitute of Mathematics, Statistics and Actuarial Science, University of Kent, Canterbury, CT2 7NF, U.K. cDepartment of Statistics, University of Warwick, Coventry, CV4 7AL, U.K. Abstract A multivariate distribution which generalizes the Dirichlet distribution is intro- duced and its use for modeling overdispersion in count data is discussed. The distribution is constructed by normalizing a vector of independent tempered sta- ble random variables. General formulae for all moments and cross-moments of the distribution are derived and they are found to have similar forms to those for the Dirichlet distribution. The univariate version of the distribution can be used as a mixing distribution for the success probability of a binomial distribution to define an alternative to the well-studied beta-binomial distribution. Examples of fitting this model to simulated and real data are presented. Keywords: Distribution on the unit simplex; Mice fetal mortality; Mixed bino- mial; Normalized random measure; Overdispersion 1. Introduction In many experiments we observe data as the number of observations in a sam- ple with some property. The binomial distribution is a natural model for this type of data. However, the data are often found to be overdispersed relative to that ∗Corresponding author. Tel.: +44-2476-523369; fax: +44-2476-524532 Email addresses: [email protected] (M. Kolossiatis), [email protected] (J.E. Griffin), [email protected] (M.
    [Show full text]
  • Supplementary Information on the Negative Binomial Distribution
    Supplementary information on the negative binomial distribution November 25, 2013 1 The Poisson distribution Consider a segment of DNA as a collection of a large number, say N, of individual intervals over which a break can take place. Furthermore, suppose the probability of a break in each interval is the same and is very small, say p(N), and breaks occur in the intervals independently of one another. Here, we need p to be a function of N because if there are more locations (i.e. N is increased) we need to make p smaller if we want to have the same model (since there are more opportunities for breaks). Since the number of breaks is a cumulative sum of iid Bernoulli variables, the distribution of breaks, y, is binomially distributed with parameters (N; p(N)), that is N P (y = k) = p(N)k(1 − p(N))N−k: k If we suppose p(N) = λ/N, for some λ, and we let N become arbitrarily large, then we find N P (y = k) = lim p(N)k(1 − p(N))N−k N!1 k N ··· (N − k + 1) = lim (λ/N)k(1 − λ/N)N−k N!1 k! λk N ··· (N − k + 1) = lim (1 − λ/N)N (1 − λ/N)−k N!1 k! N k = λke−λ=k! which is the Poisson distribution. It has a mean and variance of λ. The mean of a discrete random P 2 variable, y, is EY = k kP (Y = k) and the variance is E(Y − EY ) .
    [Show full text]
  • 5 the Poisson Process for X ∈ [0, ∞)
    Gamma(λ, r). There are a couple different ways that gamma distributions are parametrized|either in terms of λ and r as done here, or in terms of α and β. The connection is α = r, and β = 1/λ, A short introduction to Bayesian which is the expected time to the first event in a statistics, part III Poisson process. The probability density function Math 217 Probability and Statistics for a gamma distribution is Prof. D. Joyce, Fall 2014 xα−1e−x/β λrxr−1e−λx f(x) = = βαΓ(α) Γ(r) 5 The Poisson process for x 2 [0; 1). The mean of a gamma distribution 2 2 A Poisson process is the continuous version of a is αβ = r/λ while its variance is αβ = r/λ . Bernoulli process. In a Bernoulli process, time is Our job is to get information about this param- discrete, and at each time unit there is a certain eter λ. Using the Bayesian approach, we have a probability p that success occurs, the same proba- prior density function f(λ) on λ. Suppose over a bility at any given time, and the events at one time time interval of length t we observe k events. The instant are independent of the events at other time posterior density function is proportional to a con- instants. ditional probability times the prior density function In a Poisson process, time is continuous, and f(λ j k) / P (k j λ) f(λ): there is a certain rate λ of events occurring per unit time that is the same for any time interval, and Now, k and t are constants, so events occur independently of each other.
    [Show full text]
  • A New Family of Generalized Gamma Distribution and Its Application
    Journal of Mathematics and Statistics 10 (2): 211-220, 2014 ISSN: 1549-3644 © 2014 Science Publications doi:10.3844/jmssp.2014.211.220 Published Online 10 (2) 2014 (http://www.thescipub.com/jmss.toc) A NEW FAMILY OF GENERALIZED GAMMA DISTRIBUTION AND ITS APPLICATION Satsayamon Suksaengrakcharoen and Winai Bodhisuwan Department of Statistics, Faculty of Science, Kasetsart University, Bangkok, 10900, Thailand Received 2014-03-06; Revised 2014-03-24; Accepted 2014-04-09 ABSTRACT The mixture distribution is defined as one of the most important ways to obtain new probability distributions in applied probability and several research areas. According to the previous reason, we have been looking for more flexible alternative to the lifetime data. Therefore, we introduced a new mixed distribution, namely the Mixture Generalized Gamma (MGG) distribution, which is obtained by mixing between generalized gamma distribution and length biased generalized gamma distribution is introduced. The MGG distribution is capable of modeling bathtub-shaped hazard rate, which contains special sub- models, namely, the exponential, length biased exponential, generalized gamma, length biased gamma and length biased generalized gamma distributions. We present some useful properties of the MGG distribution such as mean, variance, skewness, kurtosis and hazard rate. Parameter estimations are also implemented using maximum likelihood method. The application of the MGG distribution is illustrated by real data set. The results demonstrate that MGG distribution can provide the fitted values more consistent and flexible framework than a number of distribution include important lifetime data; the generalized gamma, length biased generalized gamma and the three parameters Weibull distributions. Keywords: Generalized Gamma Distribution, Length Biased Generalized Gamma Distribution, Mixture Distribution, Hazard Rate, Life Time Data Analysis 1.
    [Show full text]
  • Poisson Processes
    Chapter 10 Poisson processes 10.1 Overview The Binomial distribution and the geometric distribution describe the be- havior of two random variables derived from the random mechanism that I have called coin tossing. The name coin tossing describes the whole mecha- nism; the names Binomial and geometric refer to particular aspects of that mechanism. If we increase the tossing rate to n tosses per second and de- crease the probability of heads to a small p, while keeping the expected number of heads per second fixed at λ = np, the number of heads in a t second interval will have approximately a Bin(nt; p) distribution, which is close to the Poisson(λt). Also, the numbers of heads tossed during disjoint time intervals will still be independent random variables. In the limit, as n ! 1, we get an idealization called a Poisson process. Remark. The double use of the name Poisson is unfortunate. Much confusion would be avoided if we all agreed to refer to the mechanism as \idealized-very-fast-coin-tossing", or some such. Then the Poisson distribution would have the same relationship to idealized-very- fast-coin-tossing as the Binomial distribution has to coin-tossing. Conversely, I could create more confusion by renaming coin tossing as \the binomial process". Neither suggestion is likely to be adopted, so you should just get used to having two closely related objects with the name Poisson. Definition. A Poisson process with rate λ on [0; 1) is a random mechanism that generates \points" strung out along [0; 1) in such a way that (i) the number of points landing in any subinterval of length t is a random variable with a Poisson(λt) distribution Statistics 241/541 fall 2014 c David Pollard, 2Nov2014 1 10.
    [Show full text]
  • Handbook on Probability Distributions
    R powered R-forge project Handbook on probability distributions R-forge distributions Core Team University Year 2009-2010 LATEXpowered Mac OS' TeXShop edited Contents Introduction 4 I Discrete distributions 6 1 Classic discrete distribution 7 2 Not so-common discrete distribution 27 II Continuous distributions 34 3 Finite support distribution 35 4 The Gaussian family 47 5 Exponential distribution and its extensions 56 6 Chi-squared's ditribution and related extensions 75 7 Student and related distributions 84 8 Pareto family 88 9 Logistic distribution and related extensions 108 10 Extrem Value Theory distributions 111 3 4 CONTENTS III Multivariate and generalized distributions 116 11 Generalization of common distributions 117 12 Multivariate distributions 133 13 Misc 135 Conclusion 137 Bibliography 137 A Mathematical tools 141 Introduction This guide is intended to provide a quite exhaustive (at least as I can) view on probability distri- butions. It is constructed in chapters of distribution family with a section for each distribution. Each section focuses on the tryptic: definition - estimation - application. Ultimate bibles for probability distributions are Wimmer & Altmann (1999) which lists 750 univariate discrete distributions and Johnson et al. (1994) which details continuous distributions. In the appendix, we recall the basics of probability distributions as well as \common" mathe- matical functions, cf. section A.2. And for all distribution, we use the following notations • X a random variable following a given distribution, • x a realization of this random variable, • f the density function (if it exists), • F the (cumulative) distribution function, • P (X = k) the mass probability function in k, • M the moment generating function (if it exists), • G the probability generating function (if it exists), • φ the characteristic function (if it exists), Finally all graphics are done the open source statistical software R and its numerous packages available on the Comprehensive R Archive Network (CRAN∗).
    [Show full text]