Size-Biased Discrete-Lindley Distribution and Its Applications to Model Distribution of Freely-Forming Small Group Size

Total Page:16

File Type:pdf, Size:1020Kb

Size-Biased Discrete-Lindley Distribution and Its Applications to Model Distribution of Freely-Forming Small Group Size Biometrics & Biostatistics International Journal Research Article Open Access Size-biased discrete-lindley distribution and its applications to model distribution of freely-forming small group size Abstract Volume 7 Issue 2 - 2018 A size-biased discrete-Lindley distribution (SBDLD) has been proposed by size- biasing the discrete Lindley distribution (DLD).The moments about origin and Simon Sium, Rama Shanker moments about mean have been obtained and hence expressions for coefficient of Department of Statistics, College of Science, Eritrea Institute of variation (C.V.), skewness, kurtosis and index of dispersion have been given. The Technology, Asmara, Eritrea estimate of the parameter of SBDLD by both the method of moment and the method of maximum likelihood are the same. Applications of SBDLD have been discussed with Correspondence: Rama Shanker, Department of Statistics, four examples of observed real datasets relating to freely-forming small group size College of Science, Eritrea Institute of Technology, Asmara, at public places. The goodness of fit of SBDLD shows quite satisfactory fit over size Eritrea, Email [email protected] biased Poisson and size-biased Poisson-Lindley Distributions. Received: March 03, 2018 | Published: April 03, 2018 Keywords: Size-biased distribution, Discrete-Lindley distribution, Moments and moments based measures, Estimation of parameter, Goodness of fit Introduction Lindley7 introduced one parameter Lindley distribution having probability density function (pdf) and cumulative distribution function Let a random variable X has probability distribution (cdf). θ 2 Px0 ( ;θθ) ; x= 0,1, 2,..., > 0 . If sample units are weighted or −θ x α f( x;θθ) =(1 +xe) ; x >> 0, 0 (1.2) selected with probability proportional to x , then the corresponding θ + 1 size-biased distribution of order α is given by its probability mass function (pmf) θ x −θ x α Fx( ;θθ )=−+ 1 1 e, x> 0, > 0 (1.3) x⋅ Px0 ( ;θ ) θ = θ + 1 Px1 ( ; ) (1.1) µ′ α Ghitany et al.8 have detailed study on various statistical and ∞ αα mathematical properties, estimation of parameter and application of µθ′ = = ∑ α = Where α E( X) xP0 ( x; ) . When 1, the x=0 Lindley distribution and it has benn showed that Lindley distribution distribution is known as simple size-biased distribution and is gives better fit over exponential distribution to model waiting time applicable for size-biased sampling and forα = 2 , the distribution is data in a bank. Shanker et al.9 have detailed comparative study on known as area-biased distribution and is applicable for area-biased modeling of lifetimes data from engineering and medical science using sampling. In many statistical sampling situations care must be taken both Lindley and exponential distributions and showed that both are so that one does not inadvertently sample from size-biased distribution competing and in majority of datasets exponential distribution gives in place of the one intended. Size-biased distributions are a particular better fit over Lindley distribution. case of weighted distributions which arise naturally in practice Recently, Berhane & Shanker10 introduced discrete-Lindley when observations from a sample are recorded with probability distribution (DLD), a discrete version of Lindley distribution using proportional to some measure of unit size. In field applications, size- infinite series approach, having pmf. biased distributions can arise either because individuals are sampled θ 2 with unequal probability by design or because of unequal detection − (e 1) −θ probability. Size-biased distributions come into play when organisms θθ= +=x > P0 ( x; ) 2θ (1xe) ; x 0,1, 2, 3,...., 0 (1.4) occur in groups, and group size influences the probability of detection. e Fisher1 firstly introduced these distributions to model ascertainment Various statistical properties of DLD, estimation of parameter and biases which were later formalized by Rao2 in a unifying theory applications to model count data have been studied by Berhane & for problems where the observations fall in non-experimental, non- Shanker10 and it has been observed that it gives better fit than both replicated and non-random categories. Size-biased distributions Poisson distribution and Poisson-Lindley distribution, a Poisson have applications in environmental science, econometrics, social mixture of Lindley7 distribution and introduced by Sankaran.11 The science, biomedical science, human demography, ecology, geology, first four moments about origin and the variance of DLD obtained by forestry etc. Further, size-biasing occurs in many unexpected context Berhane & Shanker10 are given by such as statistical estimation, renewal theory, infinite divisibility of 2 µ ′ = distributions and number theory. Many researchers have done work 1 θ , on size-biased distributions including Patil & ord,3 Patil & Rao,4,5 (e − 1) Patil,6 are some among others. Submit Manuscript | http://medcraveonline.com Biom Biostat Int J. 2018;7(2):131–136. 131 © 2018 Sium et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and build upon your work non-commercially. Copyright: Size-biased discrete-lindley distribution and its applications to model distribution of freely-forming small 132 group size ©2018 Sium et al. θ parameterθ has been drawn in Figure 1. It would be recalled that 22(e + ) ′ the pmf of size-biased Poisson-Lindley distribution (SBPLD) having µ2 = , θ 2 parameter θ > 0 given by (e − 1) θ 3 xx( ++θ 2) 2θθ Px;θθ= ;x= 1, 2, 3,..., ;> 0 (2.5) 2(ee++ 74) 3 ( ) x+2 ′ θ + 2 θ + 1 µ3 = ( ) θ 3 (e − 1) 13 has been introduced by Ghitany & Mutairi, which is a size- 32θ θθ biased version of Poisson-Lindley distribution (PLD) introduced 2(eee+ 18 ++ 33 8) by Sankaran.11 Ghitany & Mutairi13 have discussed its various ′ µ4 = , θ 4 mathematical and statistical properties, estimation of the parameter (e − 1) using maximum likelihood estimation and the method of moments, and goodness of fit. Shanker et al.14 has critical study on the θ 2 2e applications of SBPLD for modeling data on thunderstorms and found µσ2 = = , that SBPLD is a better model for thunderstorms than size-biased θ 2 (e − 1) Poisson distribution (SBPD). θθ 21ee( + ) µ3 = , θ 3 (e − 1) θθ2 θ 2ee( ++ 10 e 1) µ4 = θ 4 (e − 1) In this paper, size- biased discrete Lindley distribution has been Figure 1 Behavior of pmf of SBDLD for varying values of the parameterθ . proposed and its moments about origin and moments about mean have been obtained. Behaviors of coefficient of variation, Skewness, Moments kurtosis, and index of dispersion have been discussed graphically for varying values of parameter. The method of moment and the method of The probability generating function (G(t)) and the moment maximum likelihood give the same estimate of the parameter. Finally generating function (M(t)) of SBDLD can be obtained as applications of SBDLD have been discussed with four examples of θ 3 observed real datasets relating to distribution of freely-forming small te(− 1) θ Gt( ) = for t≠ e, (3.1) group size at various public places and the fit by SBDLD has been θ 3 (et− ) observed to be quite satisfactory. and Size-biased discrete-lindley distribution θθ3 2(−t ) (ee− 1) Using (1.1) and (1.4) and the expression for the mean of DLD, = ≠ θ Mt( ) 3 for t . (3.2) θ −t a size-biased discrete-Lindley distribution (SBDLD) with parameter (e − 1) θ > 0 can be defined by its pmf. It can be easily verified that the function in (3.2) is infinitely θ 3 differentiable with respect to t , since it involves exponential terms xP⋅ 0 ( x;θ ) (e − 1) 2 −θ x Px;θ = =+=(xxe ) ; x 1, 2, 3,.., (2.1) ′ 2 ( ) 2θ of its argument. This means that all moments about origin µr ,1r ≥ µ′ 2e th 1 of SBDLD can be obtained. The r moment about origin of SBDLD It can be easily verified that SBDLD is unimodal and have (2.1) can be obtained as θ 3 increasing failure rate. Since e − 1 ∞ '2rr( ) −θ x µ =EX() = ∑ x() x+ x e Px2 ( + 1; θ ) 12 r 2θ = + x=1 θ 1 2e Px( ;θ ) e x 2 θ 3 − ∞∞ (e 1) +−θθ+ − = ∑∑rx12+ r x 2θ ((xe xe 2e xx=11= is a deceasing function of x , Px2 ( ;θ ) is log-concave. Therefore, SBDLD is unimodal, has an increasing failure rate (IFR), and hence Taking r = 1, 2, 3 and 4 and simplifying the complicated and increasing failure rate average (IFRA). It is new better than used in tedious algebraic expression, the first four raw moments (moments expectation (NBUE) and has decreasing mean residual life (DMRL). about the origin) of the SBDLD (2.1) can be obtained as The definitions, concepts and interrelationship between these aging θ 12 e + 2 concepts have been discussed in Barlow & Proschan. µ ' = 1 θ (e − 1) Behavior of the pmf of SBDLD (2.1) for varying values of the Citation: Sium S, Shanker R. Size-biased discrete-lindley distribution and its applications to model distribution of freely-forming small group size. Biom Biostat Int J. 2018;7(2):131–136. DOI: 10.15406/bbij.2018.07.00200 Size-biased discrete-lindley distribution and its applications to model distribution of freely-forming small Copyright: 133 group size ©2018 Sium et al. 2θθ θθ ' ee++74 µ 3ee (+ 1) µ = β =3 = 2 θ 2 1 32 32 (e − 1) µ θ 2 (3e ) 32θ θθ eee+18 ++ 33 8 µ ' = 2θθ 3 θ 3 µ (ee++13 1) (e − 1) β =4 = 2 2 θ θθ θθ 43 2 µ2 3e ' ee++41 171 e ++ 131 e 16 µ = 2 θ 4 θ 4 σ 3e (e − 1) γ = = θθ µ ′ −+ Now, using the relationship between central moments (moments 1 (ee12)( ) about mean) and the raw moments, the central moments of the It can be easily verified that SBDLD is over-dispersed (µσ< 2 ) SBDLD (2.1) can be obtained as θ 3e 2 2 µσ=2 = equi-dispersed (µσ= ) and under-dispersed (µσ> ) for 2 θ 2 (e − 1) ∗ θθ>=<( ) = 1.00505 .
Recommended publications
  • A Two Parameter Discrete Lindley Distribution
    Revista Colombiana de Estadística January 2016, Volume 39, Issue 1, pp. 45 to 61 DOI: http://dx.doi.org/10.15446/rce.v39n1.55138 A Two Parameter Discrete Lindley Distribution Distribución Lindley de dos parámetros Tassaddaq Hussain1;a, Muhammad Aslam2;b, Munir Ahmad3;c 1Department of Statistics, Government Postgraduate College, Rawalakot, Pakistan 2Department of Statistics, Faculty of Sciences, King Abdulaziz University, Jeddah, Saudi Arabia 3National College of Business Administration and Economics, Lahore, Pakistan Abstract In this article we have proposed and discussed a two parameter discrete Lindley distribution. The derivation of this new model is based on a two step methodology i.e. mixing then discretizing, and can be viewed as a new generalization of geometric distribution. The proposed model has proved itself as the least loss of information model when applied to a number of data sets (in an over and under dispersed structure). The competing mod- els such as Poisson, Negative binomial, Generalized Poisson and discrete gamma distributions are the well known standard discrete distributions. Its Lifetime classification, kurtosis, skewness, ascending and descending factorial moments as well as its recurrence relations, negative moments, parameters estimation via maximum likelihood method, characterization and discretized bi-variate case are presented. Key words: Characterization, Discretized version, Estimation, Geometric distribution, Mean residual life, Mixture, Negative moments. Resumen En este artículo propusimos y discutimos la distribución Lindley de dos parámetros. La obtención de este Nuevo modelo está basada en una metodo- logía en dos etapas: mezclar y luego discretizar, y puede ser vista como una generalización de una distribución geométrica. El modelo propuesto de- mostró tener la menor pérdida de información al ser aplicado a un cierto número de bases de datos (con estructuras de supra y sobredispersión).
    [Show full text]
  • Descriptive Statistics
    IMGD 2905 Descriptive Statistics Chapter 3 Summarizing Data Q: how to summarize numbers? • With lots of playtesting, there is a lot of data (good!) • But raw data is often just a pile of numbers • Rarely of interest, or even sensible Summarizing Data Measures of central tendency Groupwork 4 3 7 8 3 4 22 3 5 3 2 3 • Indicate central tendency with one number? • What are pros and cons of each? Measure of Central Tendency: Mean • Aka: “arithmetic mean” or “average” =AVERAGE(range) =AVERAGEIF() – averages if numbers meet certain condition Measure of Central Tendency: Median • Sort values low to high and take middle value =MEDIAN(range) Measure of Central Tendency: Mode • Number which occurs most frequently • Not too useful in many cases Best use for categorical data • e.g., most popular Hero group in http://pad3.whstatic.com/images/thumb/c/cd/Find-the-Mode-of-a-Set-of-Numbers- Heroes of theStep-7.jpg/aid130521-v4-728px-Find-the-Mode-of-a-Set-of-Numbers-Step-7.jpg Storm =MODE() Mean, Median, Mode? frequency Mean, Median, Mode? frequency Mean, Median, Mode? frequency Mean, Median, Mode? frequency Mean, Median, Mode? frequency Mean, Median, Mode? mean median modes mode mean median frequency no mode mean (a) median (b) frequency mode mode median median (c) mean mean frequency frequency frequency (d) (e) Which to Use: Mean, Median, Mode? Which to Use: Mean, Median, Mode? • Mean many statistical tests that use sample ‒ Estimator of population mean ‒ Uses all data Which to Use: Mean, Median, Mode? • Median is useful for skewed data ‒ e.g., income data (US Census) or housing prices (Zillo) ‒ e.g., Overwatch team (6 players): 5 people level 5, 1 person level 275 ₊ Mean is 50 - not so useful since no one at this level ₊ Median is 5 – perhaps more representative ‒ Does not use all data.
    [Show full text]
  • The Need to Measure Variance in Experiential Learning and a New Statistic to Do So
    Developments in Business Simulation & Experiential Learning, Volume 26, 1999 THE NEED TO MEASURE VARIANCE IN EXPERIENTIAL LEARNING AND A NEW STATISTIC TO DO SO John R. Dickinson, University of Windsor James W. Gentry, University of Nebraska-Lincoln ABSTRACT concern should be given to the learning process rather than performance outcomes. The measurement of variation of qualitative or categorical variables serves a variety of essential We contend that insight into process can be gained purposes in simulation gaming: de- via examination of within-student and across-student scribing/summarizing participants’ decisions for variance in decision-making. For example, if the evaluation and feedback, describing/summarizing role of an experiential exercise is to foster trial-and- simulation results toward the same ends, and error learning, the variance in a student’s decisions explaining variation for basic research purposes. will be greatest when the trial-and-error stage is at Extant measures of dispersion are not well known. its peak. To the extent that the exercise works as Too, simulation gaming situations are peculiar in intended, greater variances would be expected early that the total number of observations available-- in the game play as opposed to later, barring the usually the number of decision periods or the game world encountering a discontinuity. number of participants--is often small. Each extant measure of dispersion has its own limitations. On the other hand, it is possible that decisions may Common limitations are that, under ordinary be more consistent early and then more variable later circumstances and especially with small numbers if the student is trying to recover from early of observations, a measure may not be defined and missteps.
    [Show full text]
  • Standard Deviation - Wikipedia Visited on 9/25/2018
    Standard deviation - Wikipedia Visited on 9/25/2018 Not logged in Talk Contributions Create account Log in Article Talk Read Edit View history Wiki Loves Monuments: The world's largest photography competition is now open! Photograph a historic site, learn more about our history, and win prizes. Main page Contents Featured content Standard deviation Current events Random article From Wikipedia, the free encyclopedia Donate to Wikipedia For other uses, see Standard deviation (disambiguation). Wikipedia store In statistics , the standard deviation (SD, also represented by Interaction the Greek letter sigma σ or the Latin letter s) is a measure that is Help used to quantify the amount of variation or dispersion of a set About Wikipedia of data values.[1] A low standard deviation indicates that the data Community portal Recent changes points tend to be close to the mean (also called the expected Contact page value) of the set, while a high standard deviation indicates that A plot of normal distribution (or bell- the data points are spread out over a wider range of values. shaped curve) where each band has a Tools The standard deviation of a random variable , statistical width of 1 standard deviation – See What links here also: 68–95–99.7 rule population, data set , or probability distribution is the square root Related changes Upload file of its variance . It is algebraically simpler, though in practice [2][3] Special pages less robust , than the average absolute deviation . A useful Permanent link property of the standard deviation is that, unlike the variance, it Page information is expressed in the same units as the data.
    [Show full text]
  • Population Dispersion
    AccessScience from McGraw-Hill Education Page 1 of 7 www.accessscience.com Population dispersion Contributed by: Francis C. Evans Publication year: 2014 The spatial distribution at any particular moment of the individuals of a species of plant or animal. Under natural conditions organisms are distributed either by active movements, or migrations, or by passive transport by wind, water, or other organisms. The act or process of dissemination is usually termed dispersal, while the resulting pattern of distribution is best referred to as dispersion. Dispersion is a basic characteristic of populations, controlling various features of their structure and organization. It determines population density, that is, the number of individuals per unit of area, or volume, and its reciprocal relationship, mean area, or the average area per individual. It also determines the frequency, or chance of encountering one or more individuals of the population in a particular sample unit of area, or volume. The ecologist therefore studies not only the fluctuations in numbers of individuals in a population but also the changes in their distribution in space. See also: POPULATION DISPERSAL. Principal types of dispersion The dispersion pattern of individuals in a population may conform to any one of several broad types, such as random, uniform, or contagious (clumped). Any pattern is relative to the space being examined; a population may appear clumped when a large area is considered, but may prove to be distributed at random with respect to a much smaller area. Random or haphazard. This implies that the individuals have been distributed by chance. In such a distribution, the probability of finding an individual at any point in the area is the same for all points (Fig.
    [Show full text]
  • Summarizing Measured Data
    SummarizingSummarizing MeasuredMeasured DataData Raj Jain Washington University in Saint Louis Saint Louis, MO 63130 [email protected] These slides are available on-line at: http://www.cse.wustl.edu/~jain/cse567-08/ Washington University in St. Louis CSE567M ©2008 Raj Jain 12-1 OverviewOverview ! Basic Probability and Statistics Concepts: CDF, PDF, PMF, Mean, Variance, CoV, Normal Distribution ! Summarizing Data by a Single Number: Mean, Median, and Mode, Arithmetic, Geometric, Harmonic Means ! Mean of A Ratio ! Summarizing Variability: Range, Variance, percentiles, Quartiles ! Determining Distribution of Data: Quantile-Quantile plots Washington University in St. Louis CSE567M ©2008 Raj Jain 12-2 PartPart III:III: ProbabilityProbability TheoryTheory andand StatisticsStatistics 1. How to report the performance as a single number? Is specifying the mean the correct way? 2. How to report the variability of measured quantities? What are the alternatives to variance and when are they appropriate? 3. How to interpret the variability? How much confidence can you put on data with a large variability? 4. How many measurements are required to get a desired level of statistical confidence? 5. How to summarize the results of several different workloads on a single computer system? 6. How to compare two or more computer systems using several different workloads? Is comparing the mean sufficient? 7. What model best describes the relationship between two variables? Also, how good is the model? Washington University in St. Louis CSE567M ©2008 Raj Jain 12-3 BasicBasic ProbabilityProbability andand StatisticsStatistics ConceptsConcepts ! Independent Events: Two events are called independent if the occurrence of one event does not in any way affect the probability of the other event.
    [Show full text]
  • Swedish Translation for the ISI Multilingual Glossary of Statistical Terms, Prepared by Jan Enger, Bernhard Huitfeldt, Ulf Jorner, and Jan Wretman
    Swedish translation for the ISI Multilingual Glossary of Statistical Terms, prepared by Jan Enger, Bernhard Huitfeldt, Ulf Jorner, and Jan Wretman. Finally revised version, January 2008. For principles, see the appendix.
    [Show full text]
  • A Modified Conway-Maxwell-Poisson Type Binomial Distribution and Its
    A modified Conway–Maxwell–Poisson type binomial distribution and its applications T. Imotoa, C. M. Ngb, S. H. Ongb and S. Chakrabortyc a School of Management and Information, University of Shizuoka, 51-1 Yada, Suruga-ku, Shuzuoka 422-8526, Japan b Institute of Mathematical Sciences, Faculty of Science, University of Malaya, 50603 Kuala Lumpur, Malaysia c Department of Statistics, Dibrugarh University, Dibrugarh-786004, Assam, India Corresponding author E-mail: [email protected] Key Words: dispersion; exponential family; kurtosis; modality; queueing process; skewness. Abstract This paper proposes a generalized binomial distribution with four parameters, which is derived from the finite capacity queueing system with state-dependent service and arrival rates. This distribution is also generated from the conditional Conway–Maxwell– Poisson distribution given a sum of two Conway–Maxwell–Poisson variables. In this paper, we consider the properties about the probability mass function, index of dispersion, skew- ness and kurtosis and give applications of the proposed distribution from its geneses. The estimation method and simulation study are also considered. 1. Introduction The binomial distribution is classically utilized as a model for analyzing count data with finite support since it has a simple genesis arising from Bernoulli trials and belongs to the exponential family that can model under-dispersion, in which the variance is smaller than the mean. However, its reliance on the simple genesis limits its flexibility in many applications. arXiv:1610.05232v1 [math.ST] 17 Oct 2016 For examples, the binomial distribution cannot model over-dispersion, lepto-kurtosis when the mode is around n/2 and platy-kurtosis when the mode is around 0 or n, where n is an integer parameter of the binomial distribution.
    [Show full text]
  • Exponential Dispersion Models for Overdispersed Zero-Inflated Count
    Exponential Dispersion Models for Overdispersed Zero-Inflated Count Data Shaul K. Bar-Leva Ad Ridderb aFaculty of Tech. Man., Holon Institute of Technology, Holon, Israel, [email protected] bDepartment of EOR, VU University, Amsterdam, Netherlands, [email protected] April 1, 2020 Abstract We consider three new classes of exponential dispersion models of discrete probability dis- tributions which are defined by specifying their variance functions in their mean value parame- terization. In a previous paper (Bar-Lev and Ridder, 2020a), we have developed the framework of these classes and proved that they have some desirable properties. Each of these classes was shown to be overdispersed and zero inflated in ascending order, making them as competitive statistical models for those in use in statistical modeling. In this paper we elaborate on the computational aspects of their probability mass functions. Furthermore, we apply these classes for fitting real data sets having overdispersed and zero-inflated statistics. Classic models based on Poisson or negative binomial distributions show poor fits, and therefore many alternatives have already proposed in recent years. We execute an extensive comparison with these other proposals, from which we may conclude that our framework is a flexible tool that gives excellent results in all cases. Moreover, in most cases our model gives the best fit. Keywords: Count distributions, Exponential dispersion models, Overdispersion, Zero-inflated models, Fit models 1 Introduction In many scientific fields, one deals with systems, experiments, or phenomena that have random arXiv:2003.13854v1 [stat.ME] 30 Mar 2020 discrete outcomes. These outcomes are revealed typically through a set of data obtained by ob- serving the system and counting the occurrences.
    [Show full text]
  • Nematic Van Der Waals Free Energy
    Available Online JOURNAL OF SCIENTIFIC RESEARCH Publications J. Sci. Res. 11 (1), 1-13 (2019) www.banglajol.info/index.php/JSR On Poisson-Weighted Lindley Distribution and Its Applications R. Shanker, K. K. Shukla* Department of Statistics, College of Science, Eritrea Institute of Technology, Asmara, Eritrea Received 26 May 2018, accepted in final revised form 18 July 2018 Abstract In this paper the nature and behavior of its coefficient of variation, skewness, kurtosis and index of dispersion of Poisson- weighted Lindley distribution (P-WLD), a Poisson mixture of weighted Lindley distribution, have been proposed and the nature and behavior have been explained graphically. Maximum likelihood estimation has been discussed to estimate its parameters. Applications of the proposed distribution have been discussed and its goodness of fit has been compared with Poisson distribution (PD), Poisson-Lindley distribution (PLD), negative binomial distribution (NBD) and generalized Poisson-Lindley distribution (GPLD). Keywords: Poisson lindley distribution; Weighted lindley distribution, Compounding; Skewness; Kurtosis; Maximum likelihood estimation. © 2019 JSR Publications. ISSN: 2070-0237 (Print); 2070-0245 (Online). All rights reserved. doi: http://dx.doi.org/10.3329/jsr.v11i1.35745 J. Sci. Res. 11 (1), 1-13 (2019) 1. Introduction M. Shankaran [1] proposed the Poisson-Lindley distribution (PLD) to model count data defined by its probability mass function (pmf) 2 x 2 (1.1) P1 x; ; x 0,1,2,..., 0 1x3 Shanker and Hagos [2] proposed a simple method of finding moments of PLD and discussed the applications of PLD to model count data from biological sciences. Shanker and Shukla [3] proposed a generalized size-biased Lindley distribution which includes size-biased Poisson-Lindley distribution introduced by Ghitany et al.
    [Show full text]
  • On the Distribution Theory of Over-Dispersion Evdokia Xekalaki
    Xekalaki Journal of Statistical Distributions and Applications 2014, 1:19 http://www.jsdajournal.com/content/1/1/19 REVIEW Open Access On the distribution theory of over-dispersion Evdokia Xekalaki Correspondence: [email protected] Department of Statistics, Athens Abstract University of Economics and Business, 76 Patision St., Athens, An overview of the evolution of probability models for over-dispersion is given Greece looking at their origins, motivation, first main contributions, important milestones and applications. A specific class of models called the Waring and generalized Waring models will be a focal point. Their advantages relative to other classes of models and how they can be adapted to handle multivariate data and temporally evolving data will be highlighted. Keywords: Heterogeneity; Contagion; Clustering; Spells model; Accident proneness; Mixtures; Zero-adjusted models; Biased sampling; Generalized Waring distribution; Generalized Waring process 1. Introduction Data analysts have often to deal with data that exhibit a variability that differs from what they expect on the basis of the hypothesized model. The phenomenon is known as overdispersion if the observed variability exceeds the expected variability or under- dispersion if it is lower than expected. Such differences between observed and nominal variances can be interpreted as brought about by failures of some of the basic assumptions of the model. These can be classified by the mechanism leading to them. As summarized by Xekalaki (2006), in traditional experimental contexts, they may be caused by deviations from the hypothe- sized structure of the population, due to lack of independence between individual item responses, contagion, clustering, and heterogeneity. In observational study contexts, on the other hand, they are the result of the method of ascertainment, which can lead to partial distortion of the observations.
    [Show full text]
  • Akash Distribution, Compounding, Moments, Skewness, Kurtosis, Estimation of Parameter, Statistical Properties, Goodness of Fit
    International Journal of Probability and Statistics 2017, 6(1): 1-10 DOI: 10.5923/j.ijps.20170601.01 The Discrete Poisson-Akash Distribution Rama Shanker Department of Statistics, Eritrea Institute of Technology, Asmara, Eritrea Abstract A Poisson-Akash distribution has been obtained by compounding Poisson distribution with Akash distribution introduced by Shanker (2015). A general expression for the r th factorial moment has been derived and hence the first four moments about origin and the moments about mean has been obtained. The expressions for its coefficient of variation, skewness and kurtosis have been obtained. Its statistical properties including generating function, increasing hazard rate and unimodality and over-dispersion have been discussed. The maximum likelihood estimation and the method of moments for estimating its parameter have been discussed. The goodness of fit of the proposed distribution using maximum likelihood estimation has been given for some count data-sets and the fit is compared with that obtained by other distributions. Keywords Akash distribution, Compounding, Moments, Skewness, Kurtosis, Estimation of parameter, Statistical properties, Goodness of fit lifetime data from engineering and medical science and 1. Introduction concluded that Akash distribution has some advantage over Lindley and exponential distributions, Lindley distribution A lifetime distribution named, “Akash distribution” has some advantage over Akash and exponential having probability density function distributions and exponential distribution
    [Show full text]