Relationships Among Some Univariate Distributions

Total Page:16

File Type:pdf, Size:1020Kb

Relationships Among Some Univariate Distributions IIE Transactions (2005) 37, 651–656 Copyright C “IIE” ISSN: 0740-817X print / 1545-8830 online DOI: 10.1080/07408170590948512 Relationships among some univariate distributions WHEYMING TINA SONG Department of Industrial Engineering and Engineering Management, National Tsing Hua University, Hsinchu, Taiwan, Republic of China E-mail: [email protected], wheyming [email protected] Received March 2004 and accepted August 2004 The purpose of this paper is to graphically illustrate the parametric relationships between pairs of 35 univariate distribution families. The families are organized into a seven-by-five matrix and the relationships are illustrated by connecting related families with arrows. A simplified matrix, showing only 25 families, is designed for student use. These relationships provide rapid access to information that must otherwise be found from a time-consuming search of a large number of sources. Students, teachers, and practitioners who model random processes will find the relationships in this article useful and insightful. 1. Introduction 2. A seven-by-five matrix An understanding of probability concepts is necessary Figure 1 illustrates 35 univariate distributions in 35 if one is to gain insights into systems that can be rectangle-like entries. The row and column numbers are modeled as random processes. From an applications labeled on the left and top of Fig. 1, respectively. There are point of view, univariate probability distributions pro- 10 discrete distributions, shown in the first two rows, and vide an important foundation in probability theory since 25 continuous distributions. Five commonly used sampling they are the underpinnings of the most-used models in distributions are listed in the third row. For example, the practice. Normal distribution is indexed as R3C1, which indicates Univariate distributions are taught in most probability Row3and Column 1. The range, probability mass (density) and statistics courses in schools of business, engineering, function, mean, and variance for discrete and continuous and science. A figure illustrating the relationships among distributions are summarized in Tables 1 and 2, respectively. univariate distributions is useful to indicate how distributi- Distributions are ordered alphabetically in Tables 1 and 2. ons correspond to one another. Nakagawa and Yoda According to the index number shown in the second col- Downloaded By: [CDL Journals Account] At: 21:26 29 October 2008 (1977), Leemis (1986), and Kotz and van Dorp (2004) of- umn of Tables 1 and 2, users can easily find the distribution fer diagrams of relations among univariate distributions, required in Fig. 1. but their diagrams are not formatted in a matrix form, In Fig. 1, the distribution name, parameters and range so it is difficult for the instructor to refer to any partic- are shown in each entry.The parameters adopted satisfy the ular distribution from the diagram, and it might be dif- following conventions: n and k are integers; 0 ≤ p ≤ 1; a is ficult for users to find the required distribution quickly. the minimum; b is the maximum; µ is the expected value; Twoversions of the figure presented in this article over- m is either the median or mode; σ is the standard devi- come this shortcoming. Figure 1 is the complete figure with ation; θ is the location parameter; β and γ are scale pa- 35 univariate distributions, whereas Fig. 2 is a simpler ver- rameters; α, α1,α2 are shape parameters; and υ,υ1,υ2 are sion with 25 distributions designed for more elementary degrees of freedom. We do not always use α to denote the needs. shape parameter. For example, the parameter m (denoted Some useful references for studying univariate distribu- as the mode) in the Triangular (a, m, b); the parameter k tions can be found in Patil, Boswell, Joshi, and Ratnaparkhi (denoted as an integer) in the Erlang (k,β); υ (denoted as (1985), Patil, Boswell, and Ratnaparkhi (1985), Stigler the degrees of freedom) in the Chi-Squared (υ); and υ1,υ2 (1986), Hald (1990), Johnson et al. (1993, 1994, 1995), (both denoted as the degrees of freedom) in the F(υ1,υ2) Stuart and Ord (1994), Wimmer and Altmann (1999), are shape parameters. Evans, Hastings and Peacock (2000), and Balakrishnan Relations between entries are indicated with a dashed and Nevzorov (2003). These books individually do not arrow or solid arrow. A dashed arrow shows asymptotic re- provide all of the simple relationships shown in this lations, and a solid arrow shows transformations or special article. cases. The random variable X is used for all distributions. 0740-817X C 2005 “IIE” 652 Song Downloaded By: [CDL Journals Account] At: 21:26 29 October 2008 Fig. 1. Relationships among 35 distributions. (An arrow beginning and ending at the same rectangle indicates that it remains in the same distribution family, but the parameter values might change.) Forexample, the arrow from the Normal (R3C1) to the In Fig. 1, if more than one random variable is involved Standard Normal (R3C2) indicates that subtracting the to form a transformation, the relationship between these mean from a Normal and dividing by its standard devi- random variables is denoted by “iid” (independent and ation yields a Standard Normal distribution. identically distributed), “indep” (independent), or “k-dim Relationships among some univariate distributions 653 Table 1. Discrete distributions Distribution Index Probability Expected name no. Range mass function value Variance Bernoulli (p) R1C3 x = 0, 1 px(1 − p)1−x pp(1 − p) , = , ,..., n x − n−x − Binomial (n p) R2C3 x 0 1 nCxp (1 p) np np(1 p) = , ,..., / n / n 2/ − n / 2 Discrete R1C5 x x1 x2 xn 1 n i=1 xi n i=1 xi n ( i=1 xi n) Uniform (x1,...,xn) Equal-Spaced R2C5 x = a, a + c, 1/n (a + b)/2 c2(n2 − 1)/12 , , + ,..., = + b−a Uniform (a b c) a 2c b where n 1 c Geometric (trials) (p) R2C1 x = 1, 2,... p(1 − p)x−1 1/p (1 − p)/p2 , , = { − + , } K N−K / N − N−n Hyper-Geometric (N K n) R1C2 x max n N K 0 , Cx Cn−x Cn np np(1 p)( N−1 ) ...,min{K, n} where p = K/N , = , + ,... x−1 k − x−k / − / 2 Negative Binomial (k p) R2C2 x k k 1 Ck−1 p (1 p) k pk(1 p) p CK CN−K + + − Negative R1C1 x = k,..., k−1 x−k K−k+1 k(N 1) k(N 1)(N K) (K + 1 − k) CN N−x+1 K+1 (K+1)2(K+2) Hyper-Geometric (N, K, k) N − K + k x−1 Poisson (µ) R2C4 x = 0, 1, 2,... µx exp(−µ)/x! µµ − − + β Polya (n, p,β) R1C4 x = 0, 1, 2,...,nCn x 1(p + jβ) np np(1 p)(1 n ) x j=0 1+β · n−x−1 − + β k=0 (1 p k ) · n−1 + β −1 [ i=0 (1 i )] n = / − Note: Cm n! [m!(n m)!]. Downloaded By: [CDL Journals Account] At: 21:26 29 October 2008 Fig. 2. Relationships among 25 distributions. 654 Song Table 2. Continuous distributions Index Probability Expected Distribution name no. Range density function value Variance Arc-Sine R4C5 [0,1] [π x(1 − x)]−1 1/21/8 α − α − α +α 1 1 − 2 1 α α α α ,α ( 1 2)x (1 x) 1 1 2 Beta ( 1 2) R5C4 [0,1] 2 (α1)(α2) α1+α2 (α1+α2) (α1+α2+1) θ,β R {βπ + x−θ 2 }−1 Cauchy ( ) R4C4 [1 ( β ) ] NA NA υ −1 − / υ R+ x 2 e x 2 υ υ Chi-Squared ( ) R3C5 (υ/2)2υ/2 2 − ,β R+ xk 1 exp(−x/β) β β2 Erlang (k ) R6C3 βk (k−1)! k k Exponential (β) R6C2 R+ β−1 exp(−x/β) ββ2 Extreme Value (θ,γ) R4C1 R γ −1 exp[−(x − θ)/γ ] θ + 0.57722γγ2π 2/6 · exp{− exp[−(x − θ)/γ ]} υ +υ 1 2 υ /2 υ / − 2 ( )(υ /υ ) 1 ( 1 2) 1 υ 2υ (υ +υ −2) υ ,υ R+ 2 1 2 x 2 ,v > 2 1 2 ,v > F( 1 2) R3C4 υ / υ / −1 (υ +υ )/2 υ − 2 2 υ υ − 2 υ − 2 4 ( 1 2) ( 2 2) (1+υ1υ x) 1 2 2 2 1( 2 2) ( 2 4) α− 2 α, β R+ (x/β) 1exp(−x/β) αβ αβ 2 Gamma ( ) R5C3 β(α) β 1 +α 2 +α 2 1 +α + α α α − α ( α 1) ( α 1) ( α 1) R 2 x 1 2 1 − x 2 2 β2{ 2 − 2 } Generalized Gamma R5C2 ( ) exp[ ( ) ] 2 (α1)β β β (α1) (α1) (α1) (α1,α2,β) Laplace (µ, β) R7C1 R (2β)−1 exp(−|x − µ|/β) µ 2β2 µ, β R exp[−(x−µ)/β] µπ2β2/ Logistic ( ) R7C2 β{1+exp[−(x−µ)/β]}2 3 −α −1 + α2 exp(−α1)x 2 Loglogistic (α ,α ) R7C4 R −α δ exp(−η)csc(δ) δ[exp(−2η)] 1 2 [1+exp(−α )x 2 ]2 1 · [tan(δ) − δ]csc2(δ) π α δ = ,η = 1 where α α √ 2 2 ,α R+ α π −1 − 1 ln(x/m) 2 α2/ 2 α2 Lognormal (m ) R4C2 (x 2 ) exp[ 2 ( α ) ] m exp( 2) m exp( ) · [exp(α2) − 1] − / x−µ 2 Normal (µ, σ 2) R3C1 R exp[( √1 2)( σ ) ] µσ2 2πσ θ,α θ,∞ αθα/ α+1 αθ ,α > αθ2 ,α > Pareto ( ) R7C5 [ ) x α−1 1 (α−1)2(α−2) 2 √ √ β R+ /β2 − /β 2 β π/ β2 − β π 2 Rayleigh ( ) R6C1 (2x )exp[ (x ) ] 2 ( 2 ) Rectangular (a, b) R6C5 [a, b](b − a)−1 (a + b)/2(b − a)2/12 Standard Cauchy R4C3 R [π(1 + x2)]−1 NA NA (θ = 0,β = 1) R exp(−x) π 2/ Standard Logistic R7C3 [1+exp(−x)]2 0 3 (µ = 0,β = 1) Downloaded By: [CDL Journals Account] At: 21:26 29 October 2008 Standard Normal R3C2 R √1 exp(−x2/2) 0 1 2π (µ = 0,σ = 1) Standard Uniform R6C4 [0,1] 1 1/21/12 (a = 0, b = 1) υ+ / 2 T(υ) R3C3 R [( 1)√ 2] (1 + x )−(υ+1)/2 0, υ>1 υ , υ>2 (υ/2) πυ υ υ−2 2(x−a) , ≤ ≤ − − a x m Triangular (a, m, b) R5C5 [a, b] (m a)(b a) 1 (a + m + b) 1 {a2 + m2 + b2 2(b−x) , < ≤ 3 18 (b−a)(b−m) m x b − (am + ab + mb)} α, β R+ αβ −α α−1 − /β α β + 1 β2 + 2 Weibull ( ) R5C1 x exp[ (x ) ] (1 α ) (1 α ) − β + 1 2 [ (1 α )] Note: R+ denotes [0, ∞) and R denotes (−∞, ∞).
Recommended publications
  • Field Guide to Continuous Probability Distributions
    Field Guide to Continuous Probability Distributions Gavin E. Crooks v 1.0.0 2019 G. E. Crooks – Field Guide to Probability Distributions v 1.0.0 Copyright © 2010-2019 Gavin E. Crooks ISBN: 978-1-7339381-0-5 http://threeplusone.com/fieldguide Berkeley Institute for Theoretical Sciences (BITS) typeset on 2019-04-10 with XeTeX version 0.99999 fonts: Trump Mediaeval (text), Euler (math) 271828182845904 2 G. E. Crooks – Field Guide to Probability Distributions Preface: The search for GUD A common problem is that of describing the probability distribution of a single, continuous variable. A few distributions, such as the normal and exponential, were discovered in the 1800’s or earlier. But about a century ago the great statistician, Karl Pearson, realized that the known probabil- ity distributions were not sufficient to handle all of the phenomena then under investigation, and set out to create new distributions with useful properties. During the 20th century this process continued with abandon and a vast menagerie of distinct mathematical forms were discovered and invented, investigated, analyzed, rediscovered and renamed, all for the purpose of de- scribing the probability of some interesting variable. There are hundreds of named distributions and synonyms in current usage. The apparent diver- sity is unending and disorienting. Fortunately, the situation is less confused than it might at first appear. Most common, continuous, univariate, unimodal distributions can be orga- nized into a small number of distinct families, which are all special cases of a single Grand Unified Distribution. This compendium details these hun- dred or so simple distributions, their properties and their interrelations.
    [Show full text]
  • Probability Distributions Used in Reliability Engineering
    Probability Distributions Used in Reliability Engineering Probability Distributions Used in Reliability Engineering Andrew N. O’Connor Mohammad Modarres Ali Mosleh Center for Risk and Reliability 0151 Glenn L Martin Hall University of Maryland College Park, Maryland Published by the Center for Risk and Reliability International Standard Book Number (ISBN): 978-0-9966468-1-9 Copyright © 2016 by the Center for Reliability Engineering University of Maryland, College Park, Maryland, USA All rights reserved. No part of this book may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or by any information storage and retrieval system, without permission in writing from The Center for Reliability Engineering, Reliability Engineering Program. The Center for Risk and Reliability University of Maryland College Park, Maryland 20742-7531 In memory of Willie Mae Webb This book is dedicated to the memory of Miss Willie Webb who passed away on April 10 2007 while working at the Center for Risk and Reliability at the University of Maryland (UMD). She initiated the concept of this book, as an aid for students conducting studies in Reliability Engineering at the University of Maryland. Upon passing, Willie bequeathed her belongings to fund a scholarship providing financial support to Reliability Engineering students at UMD. Preface Reliability Engineers are required to combine a practical understanding of science and engineering with statistics. The reliability engineer’s understanding of statistics is focused on the practical application of a wide variety of accepted statistical methods. Most reliability texts provide only a basic introduction to probability distributions or only provide a detailed reference to few distributions.
    [Show full text]
  • Some Statistical Aspects of Spatial Distribution Models for Plants and Trees
    STUDIA FORESTALIA SUECICA Some Statistical Aspects of Spatial Distribution Models for Plants and Trees PETER DIGGLE Department of Operational Efficiency Swedish University of Agricultural Sciences S-770 73 Garpenberg. Sweden SWEDISH UNIVERSITY OF AGRICULTURAL SCIENCES COLLEGE OF FORESTRY UPPSALA SWEDEN Abstract ODC 523.31:562.6/46 The paper is an account of some stcctistical ur1alysrs carried out in coniur~tionwith a .silvicultural research project in the department of Oi_leruriotzal Efficiency. College of Forestry, Gcwpenberg. Sweden (Erikssorl, L & Eriksson. 0 in prepuratiotz 1981). 1~ Section 2, u statbtic due ro Morurz (19501 is irsed to detect spafiul intercccriorl amongst coutlts of the ~u~rnbersof yo14119 trees in stimple~~lo~slaid out in a systelnntic grid arrangement. Section 3 discusses the corlstruction of hivuriate disrrtbutions for the numbers of~:ild and plunted trees in a sample plot. Sectiorc 4 comiders the relutionship between the distributions of the number oj trees in a plot arld their rota1 busal area. Secrion 5 is a review of statistical nlerhodc for xte in cot~tlectiorz w~thpreliminai~ surveys of large areas of forest. In purticular, Secrion 5 dzscusses tests or spurrnl randotnness for a sillgle species, tests of independence betwren two species, and estimation of the number of trees per unit urea. Manuscr~ptrecelved 1981-10-15 LFIALLF 203 82 003 ISBN 9 1-38-07077-4 ISSN 0039-3150 Berllngs, Arlov 1982 Preface The origin of this report dates back to 1978. At this time Dr Peter Diggle was employed as a guest-researcher at the College. The aim of Dr Diggle's work at the College was to introduce advanced mathematical and statistical models in forestry research.
    [Show full text]
  • Multivariate Distributions
    IEOR E4602: Quantitative Risk Management Spring 2016 c 2016 by Martin Haugh Multivariate Distributions We will study multivariate distributions in these notes, focusing1 in particular on multivariate normal, normal-mixture, spherical and elliptical distributions. In addition to studying their properties, we will also discuss techniques for simulating and, very briefly, estimating these distributions. Familiarity with these important classes of multivariate distributions is important for many aspects of risk management. We will defer the study of copulas until later in the course. 1 Preliminary Definitions Let X = (X1;:::Xn) be an n-dimensional vector of random variables. We have the following definitions and statements. > n Definition 1 (Joint CDF) For all x = (x1; : : : ; xn) 2 R , the joint cumulative distribution function (CDF) of X satisfies FX(x) = FX(x1; : : : ; xn) = P (X1 ≤ x1;:::;Xn ≤ xn): Definition 2 (Marginal CDF) For a fixed i, the marginal CDF of Xi satisfies FXi (xi) = FX(1;:::; 1; xi; 1;::: 1): It is straightforward to generalize the previous definition to joint marginal distributions. For example, the joint marginal distribution of Xi and Xj satisfies Fij(xi; xj) = FX(1;:::; 1; xi; 1;:::; 1; xj; 1;::: 1). If the joint CDF is absolutely continuous, then it has an associated probability density function (PDF) so that Z x1 Z xn FX(x1; : : : ; xn) = ··· f(u1; : : : ; un) du1 : : : dun: −∞ −∞ Similar statements also apply to the marginal CDF's. A collection of random variables is independent if the joint CDF (or PDF if it exists) can be factored into the product of the marginal CDFs (or PDFs). If > > X1 = (X1;:::;Xk) and X2 = (Xk+1;:::;Xn) is a partition of X then the conditional CDF satisfies FX2jX1 (x2jx1) = P (X2 ≤ x2jX1 = x1): If X has a PDF, f(·), then it satisfies Z xk+1 Z xn f(x1; : : : ; xk; uk+1; : : : ; un) FX2jX1 (x2jx1) = ··· duk+1 : : : dun −∞ −∞ fX1 (x1) where fX1 (·) is the joint marginal PDF of X1.
    [Show full text]
  • Univariate Probability Distributions
    Journal of Statistics Education ISSN: (Print) 1069-1898 (Online) Journal homepage: http://www.tandfonline.com/loi/ujse20 Univariate Probability Distributions Lawrence M. Leemis, Daniel J. Luckett, Austin G. Powell & Peter E. Vermeer To cite this article: Lawrence M. Leemis, Daniel J. Luckett, Austin G. Powell & Peter E. Vermeer (2012) Univariate Probability Distributions, Journal of Statistics Education, 20:3, To link to this article: http://dx.doi.org/10.1080/10691898.2012.11889648 Copyright 2012 Lawrence M. Leemis, Daniel J. Luckett, Austin G. Powell, and Peter E. Vermeer Published online: 29 Aug 2017. Submit your article to this journal View related articles Full Terms & Conditions of access and use can be found at http://www.tandfonline.com/action/journalInformation?journalCode=ujse20 Download by: [College of William & Mary] Date: 06 September 2017, At: 10:32 Journal of Statistics Education, Volume 20, Number 3 (2012) Univariate Probability Distributions Lawrence M. Leemis Daniel J. Luckett Austin G. Powell Peter E. Vermeer The College of William & Mary Journal of Statistics Education Volume 20, Number 3 (2012), http://www.amstat.org/publications/jse/v20n3/leemis.pdf Copyright c 2012 by Lawrence M. Leemis, Daniel J. Luckett, Austin G. Powell, and Peter E. Vermeer all rights reserved. This text may be freely shared among individuals, but it may not be republished in any medium without express written consent from the authors and advance notification of the editor. Key Words: Continuous distributions; Discrete distributions; Distribution properties; Lim- iting distributions; Special Cases; Transformations; Univariate distributions. Abstract Downloaded by [College of William & Mary] at 10:32 06 September 2017 We describe a web-based interactive graphic that can be used as a resource in introductory classes in mathematical statistics.
    [Show full text]
  • Package 'Distr' Standardmethods Utility to Automatically Generate Accessor and Replacement Functions
    Package ‘distr’ March 11, 2019 Version 2.8.0 Date 2019-03-11 Title Object Oriented Implementation of Distributions Description S4-classes and methods for distributions. Depends R(>= 2.14.0), methods, graphics, startupmsg, sfsmisc Suggests distrEx, svUnit (>= 0.7-11), knitr Imports stats, grDevices, utils, MASS VignetteBuilder knitr ByteCompile yes Encoding latin1 License LGPL-3 URL http://distr.r-forge.r-project.org/ LastChangedDate {$LastChangedDate: 2019-03-11 16:33:22 +0100 (Mo, 11 Mrz 2019) $} LastChangedRevision {$LastChangedRevision: 1315 $} VCS/SVNRevision 1314 NeedsCompilation yes Author Florian Camphausen [ctb] (contributed as student in the initial phase --2005), Matthias Kohl [aut, cph], Peter Ruckdeschel [cre, cph], Thomas Stabla [ctb] (contributed as student in the initial phase --2005), R Core Team [ctb, cph] (for source file ks.c/ routines 'pKS2' and 'pKolmogorov2x') Maintainer Peter Ruckdeschel <[email protected]> Repository CRAN Date/Publication 2019-03-11 20:32:54 UTC 1 2 R topics documented: R topics documented: distr-package . .5 AbscontDistribution . 12 AbscontDistribution-class . 14 Arcsine-class . 18 Beta-class . 19 BetaParameter-class . 21 Binom-class . 22 BinomParameter-class . 24 Cauchy-class . 25 CauchyParameter-class . 27 Chisq-class . 28 ChisqParameter-class . 30 CompoundDistribution . 31 CompoundDistribution-class . 32 convpow-methods . 34 d-methods . 36 decomposePM-methods . 36 DExp-class . 37 df-methods . 39 df1-methods . 39 df2-methods . 40 dim-methods . 40 dimension-methods . 40 Dirac-class . 41 DiracParameter-class . 42 DiscreteDistribution . 44 DiscreteDistribution-class . 46 distr-defunct . 49 distrARITH . 50 Distribution-class . 51 DistributionSymmetry-class . 52 DistrList . 53 DistrList-class . 54 distrMASK . 55 distroptions . 56 DistrSymmList . 58 DistrSymmList-class . 59 EllipticalSymmetry .
    [Show full text]
  • Bivariate Frequency of Meteorological Drought in the Upper Minjiang River Based on Copula Function
    water Article Bivariate Frequency of Meteorological Drought in the Upper Minjiang River Based on Copula Function Fangling Qin 1, Tianqi Ao 1,2,* and Ting Chen 1,3,* 1 College of Water Resources & Hydropower, Sichuan University, Chengdu 610065, China; [email protected] 2 State Key Laboratory of Hydraulic and Mountain River Engineering, Sichuan University, Chengdu 610065, China 3 Heavy Rain and Drought-Flood Disasters in Plateau and Basin Key Laboratory of Sichuan Province, Chengdu 610072, China * Correspondence: [email protected] (T.A.); [email protected] (T.C.); Tel.: +86-028-8540-1149 (T.A.); +86-028-6028-0028 (T.C.) Abstract: Based on the Standardized Precipitation Index (SPI) and copula function, this study analyzed the meteorological drought in the upper Minjiang River basin. The Tyson polygon method is used to divide the research area into four regions based on four meteorological stations. The monthly precipitation data of four meteorological stations from 1966 to 2016 were used for the calculation of SPI. The change trend of SPI1, SPI3 and SPI12 showed the historical dry-wet evolution phenomenon of short-term humidification and long-term aridification in the study area. The major drought events in each region are counted based on SPI3. The results show that the drought lasted the longest in Maoxian region, the occurrence of minor drought events was more frequent than the other regions. Nine distribution functions are used to fit the marginal distribution of drought duration (D), severity (S) and peak (P) estimated based on SPI3, the best marginal distribution is Citation: Qin, F.; Ao, T.; Chen, T.
    [Show full text]
  • Goodness of Fit
    GOODNESS OF FIT INTRODUCTION Goodness of fit tests are used to determine how well the shape of a sample of data obtained from an experiment matches a conjectured or hypothesized distribution shape for the population from which the data was collected. The idea behind a goodness-of-fit test is to see if the sample comes from a population with the claimed distribution. Another way of looking at that is to ask if the frequency distribution fits a specific pattern, or even more to the point, how do the actual observed frequencies in each class interval of a histogram compare to the frequencies that theoretically would be expected to occur if the data exactly followed the hypothesized probability distribution. This is relevant to cost risk analysis because we often want to apply a distribution to an element of cost based on observed sample data. A goodness of fit test is a statistical hypothesis test: Set up the null and alternative hypothesis; determine alpha; calculate a test statistic; look-up a critical value statistic; draw a conclusion. In this course, we will discuss three different methods or tests that are commonly used to perform Goodness-of-Fit analyses: the Chi-Square (χ2) test, the Kolmogorov-Smirnov One Sample Test, and the Anderson-Darling test. The Kolmogorov-Smirnov and Anderson-Darling tests are restricted to continuous distributions while the χ2 test can be applied to both discrete and continuous distributions. GOODNESS OF FIT TESTS CHI SQUARE TEST The Chi-Square test is used to test if a sample of data came from a population with a specific distribution.
    [Show full text]
  • Joint Mixability of Elliptical Distributions and Related Families
    Joint Mixability of Elliptical Distributions and Related Families Chuancun Yin ∗ School of Statistics, Qufu Normal University Shandong 273165, China e-mail: [email protected] Dan Zhu School of Statistics, Qufu Normal University Shandong 273165, China September 13, 2018 Abstract In this paper, we further develop the theory of complete mixability and joint mixability for some distribution families. We generalize a result of R¨uschendorf and Uckelmann (2002) related to complete mixability of continuous distribution function having a symmetric and unimodal density. Two different proofs to a result of Wang and Wang (2016) which related to the joint mixability of elliptical distributions with the same characteristic generator are present. We solve the Open Problem 7 in Wang (2015) by arXiv:1706.05499v3 [math.ST] 12 Sep 2018 constructing a bimodal-symmetric distribution. The joint mixability of slash-elliptical distributions and skew-elliptical distributions is studied and the extension to multivariate distributions is also investigated. Keywords: Complete mixability; joint mixability; multivariate dependence; slash/skew- elliptical distributions; elliptical distributions AMS subject classifications: 60E05 60E10 60E15 ∗Corresponding author. 1 1 Introduction In recent years, the problem of studying complete mixability and joint mixability of distributions has received considerable attention. Complete mixability and joint mixabil- ity describe whether it is possible to generate random variables (or vectors) from given distributions with constant sum. The formally definition of complete mixability for a distribution was first introduced in Wang and Wang (2011) and then extended to an arbi- trary set of distributions in Wang, Peng and Yang (2013), although the concept has been used in variance reduction problems earlier (see Gaffke and R¨uschendorf (1981), Knott and Smith (2006), R¨uschendorf and Uckelmann (2002)).
    [Show full text]
  • Theorizing Probability Distribution in Applied Statistics
    Journal of Population and Development, June 2020 | Indra Malakar/Theorizing Probability Distribution in ... Received Date: January 2020 Revised: March 2020 Accepted: June 2020 Theorizing Probability Distribution in Applied Statistics Indra Malakar (M. Phil.)* Abstract This paper investigates into theoretical knowledge on probability distribution and the application of binomial, poisson and normal distribution. Binomial distribution is widely used discrete random variable when the trails are repeated under identical condition for fixed number of times and when there are only two possible outcomes whereas poisson distribution is for discrete random variable for which the probability of occurrence of an event is small and the total number of possible cases is very large and normal distribution is limiting form of binomial distribution and used when the number of cases is infinitely large and probabilities of success and failure is almost equal. Key words: Application, binomial, discrete, normal, occurrence & poisson. Introduction In statistics, probability theory and probability distribution are mathematical functions that provide the probabilities of occurrence of different possible outcomes in an experiment (Ash, 2008). In more technical terms, the probability distribution is a description of a random phenomenon in terms of the probabilities of events (Michael, 2010). A probability distribution is specified in terms of an underlying sample space, which is the set of all possible outcomes of the random phenomenon being observed. The sample space may be the set of real numbers or a set of vectors, or it may be a list of non-numerical values; for example, the sample space of a coin flip would be (heads & tails). Probability distributions are generally categorized into two classes.
    [Show full text]
  • Mahalanobis' Distance Beyond Normal Distributions
    MAHALANOBIS' DISTANCE BEYOND NORMAL DISTRIBUTIONS JOAKIM EKSTROM¨ Abstract. Based on the reasoning expressed by Mahalanobis in his original article, the present article extends the Mahalanobis distance beyond the set of normal distributions. Sufficient conditions for existence and uniqueness are studied, and some properties de- rived. Since many statistical methods use the Mahalanobis distance as e vehicle, e.g. the method of least squares and the chi-square hypothesis test, extending the Mahalanobis distance beyond normal distributions yields a high ratio of output to input, because those methods are then instantly generalized beyond the normal distributions as well. Maha- lanobis' idea also has a certain conceptual beauty, mapping random variables into a frame of reference which ensures that apples are compared to apples. 1 2 1. Introduction Distances have been used in statistics for centuries. Carl Friedrich Gauss (1809) pro- posed using sums of squares, a squared Euclidean distance, for the purpose of fitting Kepler orbits to observations of heavenly bodies. Karl Pearson (1900) proposed a weighted Eu- clidean distance, which he denoted χ, for the construction of a hypothesis test. Both Gauss' and Pearson's methods presume statistically independent and normally distributed observational errors, and therefore their distances are nowadays recognized as special cases of the Mahalanobis distance. Proposed by Mahalanobis (1936), the Mahalanobis distance is a distance that accounts for probability distribution and equals the Euclidean distance under the standard normal distribution. The rationale for the latter fact is largely historical, but both Gauss (1809) and Pearson (1900) discussed the fact that the density function of a standard normally distributed random variable equals (a normalizing constant times) the composition g ◦jj·jj, where jj · jj is the Euclidean norm and g the Gaussian function e−x2=2.
    [Show full text]
  • Random Variables
    Random Variables. in a Nutshell 1 AT Patera, M Yano September 22, 2014 Draft V1.2 @c MIT 2014. From Math, Numerics, & Programming for Mechanical Engineers . in a Nutshell by AT Patera and M Yano. All rights reserved. 1 Preamble It is often the case in engineering analysis that the outcome of a (random) experiment is a numerical value: a displacement or a velocity; a Young’s modulus or thermal conductivity; or perhaps the yield of a manufacturing process. In such cases we denote our experiment a random variable. In this nutshell we develop the theory of random variables from definition and characterization to simulation and finally estimation. In this nutshell: We describe univariate and bivariate discrete random variables: probability mass func­ tions (joint, marginal, conditional); independence; transformations of random vari­ ables. We describe continuous random variables: probably density function; cumulative dis­ tribution function; quantiles. We define expectation generally, and the mean, variance, and standard deviation in particular. We provide a frequentist interpretation of mean. We connect mean, vari­ ance, and probability through Chebyshev’s Inequality and (briefly) the Central Limit Theorem. We summarize the properties and relevance of several ubiquitous probability mass and density functions: univariate and bivariate discrete uniform; Bernoulli; binomial; univariate and bivariate continuous uniform; univariate normal. We introduce pseudo-random variates: generation; transformation; application to hy­ pothesis testing and Monte Carlo simulation. We define a random sample of “i.i.d.” random variables and the associated sample mean. We derive the properties of the sample mean, in particular the mean and variance, relevant to parameter estimation.
    [Show full text]