A Multivariate Student's T-Distribution

Total Page:16

File Type:pdf, Size:1020Kb

A Multivariate Student's T-Distribution Open Journal of Statistics, 2016, 6, 443-450 Published Online June 2016 in SciRes. http://www.scirp.org/journal/ojs http://dx.doi.org/10.4236/ojs.2016.63040 A Multivariate Student’s t-Distribution Daniel T. Cassidy Department of Engineering Physics, McMaster University, Hamilton, ON, Canada Received 29 March 2016; accepted 14 June 2016; published 17 June 2016 Copyright © 2016 by author and Scientific Research Publishing Inc. This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/ Abstract A multivariate Student’s t-distribution is derived by analogy to the derivation of a multivariate normal (Gaussian) probability density function. This multivariate Student’s t-distribution can have different shape parameters νi for the marginal probability density functions of the multi- variate distribution. Expressions for the probability density function, for the variances, and for the covariances of the multivariate t-distribution with arbitrary shape parameters for the marginals are given. Keywords Multivariate Student’s t, Variance, Covariance, Arbitrary Shape Parameters 1. Introduction An expression for a multivariate Student’s t-distribution is presented. This expression, which is different in form than the form that is commonly used, allows the shape parameter ν for each marginal probability density function (pdf) of the multivariate pdf to be different. The form that is typically used is [1] −+ν Γ+((ν n) 2) T ( n) 2 +Σ−1 n 2 (1.[xx] [ ]) (1) ΓΣ(νν2)(π ) This “typical” form attempts to generalize the univariate Student’s t-distribution and is valid when the n marginal distributions have the same shape parameter ν . The shape of this multivariate t-distribution arises from the observation that the pdf for [ xy] = [ ] σ is given by Equation (1) when [ y] is distributed as a multivariate normal distribution with covariance matrix [Σ] and σ 2 is distributed as chi-squared. The multivariate Student’s t-distribution put forth here is derived from a Cholesky decomposition of the scale matrix by analogy to the multivariate normal (Gaussian) pdf. The derivation of the multivariate normal pdf is How to cite this paper: Cassidy, D.T. (2016) A Multivariate Student’s t-Distribution. Open Journal of Statistics, 6, 443-450. http://dx.doi.org/10.4236/ojs.2016.63040 D. T. Cassidy given in Section 2 to provide background. The multivariate Student’s t-distribution and the variances and covariances for the multivariate t-distribution are given in Section 3. Section 4 is a conclusion. 2. Background Information 2.1. Cholesky Decomposition A method to produce a multivariate pdf with known scale matrix [Σs ] is presented in this section. For nor- mally distributed variables, the covariance matrix [Σ=Σ] [ s ] since the scale factor for a normal distribution is the standard deviation of the distribution. An example with n = 4 is used to provide concrete examples. Consider the transformation [ yx] = [M ][ ] where [ y] and [ x] are 41× column matrices, [M ] is 44× square matrix, and the elements of [ x] are independent random variables. The off-diagonal elements of [M ] introduce correlations between the elements of [ y] . yx11m1,1 000 mm 00 yx222,1 2,2 = (2) yxmmm3,1 3,2 3,3 0 33 yx44mmmm4,1 4,2 4,3 4,4 T The scale matrix [Σ=s ] [MM][ ] . The covariance matrix [Σ] has elements Σ=ij, E{yyij} where E{yyij} is the expectation of yyij and Σ=Σi,, j ji. If the [ x] are normally distributed, then T [Σ=Σ] [ s ] =[MM][ ] , where the superscript T indicates a transpose of the matrix. If [Σs ] is known, then [M ] is the Cholesky decomposition of the matrix [Σs ] [2]. For the 44× example of Equation (2), 2 m1,1 mm1,1 2,1 mm1,1 3,1 mm1,1 4,1 22 mm1,1 2,1 m2,1++ m 2,2 mm2,1 3,1 mm 2,2 3,2 mm2,1 4,1 + mm 2,2 4,2 Σ= . (3) [ s ] 22 2 mmmmmm1,1 3,1 2,1 3,1+ 2,2 3,2 mmm3,1++ 3,2 3,3 mmmmmm3,1 4,1+ 3,2 4,2 + 3,3 4,3 22 22 mmmmmmmmmmm1,1 4,1 2,1 4,1+ 2,2 4,2 3,1 4,1 ++ 3,2 4,2 3,3m4,3 mmmm 4,1+++ 4,2 4,3 4,4 T T 2 From linear algebra, det ([MM]) = det[ M] det[ M] = ( det [ M]) . For [M ] as defined in Equation (2), 2 det M= mm mm and det Σ=mm22 mm 2 2=det M whereas Σ=E yy =σ 2 is the va- [ ] 1,1 2,2 3,3 4,4 [ s ] 1,1 2,2 3,3 4,4 ( [ ]) ii, { ii} yi Σ=E yy =σ 2 riance of the zero-mean random variable yi and ij, { i j} yyij is the covariance of the zero-mean random variables yi and y j . 2.2. Multivariate Normal Probability Density Function To create a multivariate normal pdf, start with the joint pdf fxN ([ ]) for n unit normal, zero mean, independent random variables [ x]: n 11 1 1T f([ x]) =exp −=x2 exp −[xx] [ ] (4) N ∏ i n i=1 2π 22(2π) T where [x] is an n-row column matrix: [x] = [ xx12,,, xn ] . fN ([ x])dd xx12 d xn gives the probability that the random variables [ x] lie in the interval [x] <[ x] ≤+[ xx] [d ]. The requirement for zero mean random variables is not a restriction. If E{xi } = µi , then xxii′ = − µi is a zero mean random variable with the same shape and scale parameters as xi . Use Equation (2) to transform the variables. The Jacobian determinant of the transformation relates the products of the infinitesimals of integration such that ∂ ( xx12,,, xn ) ddxxin2 d x= ddyy12 d. yn (5) ∂ ( yy12,,, yn ) 444 D. T. Cassidy The magnitude of the Jacobian determinant of the transformation [ y] = [ Mx][ ] is (Appendix) ∂ ( xx,,, x) 11 12 n = = (6) ∂ yy, , , y det M ( 12 n ) [ ] det[Σs ] 2 where the equality det[Σ=s ] ( det [M ]) has been used. −1 − Σ= T Σ=−1 T1− = 1 Since [ s ] [MM][ ] , [ s ] ([MM] ) [ ] , and since [xMy] [ ] [ ] , the multivariate “z-score” −1 − T TT− 1= T Σ −1 T1Σ Σ=Σ [xx] [ ] becomes [ yM] ([ ] ) [ My] [ ] [ y] [ s ] [ y] , which equals [ yy] [ ] [ ] since [ s ] [ ] for normally distributed variables. The result is that the unit normal, independent, multivariate pdf, Equation (4), becomes under the trans- formation Equation (2) 11T −1 fyN ([ ]) = exp−Σ[ y] [ s ] [ y] (7) n 2 (2π) det [Σs ] T where [ y] is a n-row column matrix: [ y] = [ yy12,,, yn ] and [Σ=Σs ] [ ] . For the 44× example, 1 0 00 m 1,1 m 1 − 2,1 00 −1 mm1,1 2,2 m2,2 [M ] = , (8) mm− mm m 1 3,2 2,1 3,1 2,2 − 3,2 0 mmm1,1 2,2 3,3 mm2,2 3,3 m3,3 − mm− mm m 1 1 4,3 3,2 4,2 3,3 − 4,3 m1,4 mmm2,2 3,3 4,4 mm3,3 4,4 m 4,4 −1 Σ=−1 T1− from which [ s ] ([MM] ) [ ] can be calculated. In Equation (8), −1 mmm2,1 3,2 4,3−−+ mmm 4,2 2,1 3,3 mmm 2,2 3,1 4,3 mmm 4,1 2,2 3,3 m1,4 = − . (9) mm1,1 2,2 mm 3,3 4,4 −1 The denominator in the expression for m1,4 is det [M ] . 3. Multivariate Student’s t Probability Density Function A similar approach can be used to create a multivariate Student’s t pdf. Assume truncated or effectively truncated t-distributions, so that moments exist [3] [4]. For simplicity, assume that support is [µ−+bb βµ, β] where b is a positive, large number, β is the scale factor for the distribution, and µ is the location parameter for the distribution. If b is a large number, then a significant portion of the tails of the distribution are included. If b = ∞ then all of the tails are included. Start with the joint pdf for n independent, zero-mean (location parameters [µ] = 0 ) Student’s t pdfs with shape parameters [ν ] , and scale parameters [β ] = 1 : −+(ν 12) nn2 i Γ+((ν i 12) ) x f([ x];[νν]) = 1; +=i gx( ) (10) t ∏∏ν i ii ii=11Γ (ννii2) π i = with −∞ ≤xi ≤ +∞ . ft ([ x];[ν ]) dd xx12 d xn gives the probability that a random draw of the column matrix [ x] from the joint Student’s t-distribution lies in the interval [x] <[ x] ≤+[ xx] [d ]. The pdf gxi( ii;ν ) is a function of only xi and the shape parameter ν i , and thus is independent of any other gxj( jj;ν ) , ji≠ . 445 D. T. Cassidy Use the transformation of Equation (2) to create a multivariate pdf −+ν 2 ( i 12) i − my1 n ∑ ij, j 1 Γ+((ν i 12) ) j=1 fyt ([ ];[ν ]) = ∏ 1.+ (11) = Γ ν det [M ] i 1 (ννii2) π i = i −1 The solution xi ∑ j=1 myij, j of the transformation Equation (2) was used. The elements of the inverse −1 −1 matrix [M ] , mij, , are given in terms of the mij, by Equation (8) for the n = 4 example. Note that the shape parameters ν i of the constituent distributions need not be the same in the multivariate t-distribution given by fyt ([ ];[ν ]) . ft ([ y];[ν ]) dd yy12 d yn gives the probability that a random draw of the column matrix [ y] from the multivariate Student’s t-distribution with shape parameters [ν ] lies in the interval [ y] <≤+[ y] [ yy] [d ] .
Recommended publications
  • Moments of the Product and Ratio of Two Correlated Chi-Square Variables
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Springer - Publisher Connector Stat Papers (2009) 50:581–592 DOI 10.1007/s00362-007-0105-0 REGULAR ARTICLE Moments of the product and ratio of two correlated chi-square variables Anwar H. Joarder Received: 2 June 2006 / Revised: 8 October 2007 / Published online: 20 November 2007 © The Author(s) 2007 Abstract The exact probability density function of a bivariate chi-square distribu- tion with two correlated components is derived. Some moments of the product and ratio of two correlated chi-square random variables have been derived. The ratio of the two correlated chi-square variables is used to compare variability. One such applica- tion is referred to. Another application is pinpointed in connection with the distribution of correlation coefficient based on a bivariate t distribution. Keywords Bivariate chi-square distribution · Moments · Product of correlated chi-square variables · Ratio of correlated chi-square variables Mathematics Subject Classification (2000) 62E15 · 60E05 · 60E10 1 Introduction Fisher (1915) derived the distribution of mean-centered sum of squares and sum of products in order to study the distribution of correlation coefficient from a bivariate nor- mal sample. Let X1, X2,...,X N (N > 2) be two-dimensional independent random vectors where X j = (X1 j , X2 j ) , j = 1, 2,...,N is distributed as a bivariate normal distribution denoted by N2(θ, ) with θ = (θ1,θ2) and a 2 × 2 covariance matrix = (σik), i = 1, 2; k = 1, 2. The sample mean-centered sums of squares and sum of products are given by a = N (X − X¯ )2 = mS2, m = N − 1,(i = 1, 2) ii j=1 ij i i = N ( − ¯ )( − ¯ ) = and a12 j=1 X1 j X1 X2 j X2 mRS1 S2, respectively.
    [Show full text]
  • A Study of Non-Central Skew T Distributions and Their Applications in Data Analysis and Change Point Detection
    A STUDY OF NON-CENTRAL SKEW T DISTRIBUTIONS AND THEIR APPLICATIONS IN DATA ANALYSIS AND CHANGE POINT DETECTION Abeer M. Hasan A Dissertation Submitted to the Graduate College of Bowling Green State University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY August 2013 Committee: Arjun K. Gupta, Co-advisor Wei Ning, Advisor Mark Earley, Graduate Faculty Representative Junfeng Shang. Copyright c August 2013 Abeer M. Hasan All rights reserved iii ABSTRACT Arjun K. Gupta, Co-advisor Wei Ning, Advisor Over the past three decades there has been a growing interest in searching for distribution families that are suitable to analyze skewed data with excess kurtosis. The search started by numerous papers on the skew normal distribution. Multivariate t distributions started to catch attention shortly after the development of the multivariate skew normal distribution. Many researchers proposed alternative methods to generalize the univariate t distribution to the multivariate case. Recently, skew t distribution started to become popular in research. Skew t distributions provide more flexibility and better ability to accommodate long-tailed data than skew normal distributions. In this dissertation, a new non-central skew t distribution is studied and its theoretical properties are explored. Applications of the proposed non-central skew t distribution in data analysis and model comparisons are studied. An extension of our distribution to the multivariate case is presented and properties of the multivariate non-central skew t distri- bution are discussed. We also discuss the distribution of quadratic forms of the non-central skew t distribution. In the last chapter, the change point problem of the non-central skew t distribution is discussed under different settings.
    [Show full text]
  • On the Meaning and Use of Kurtosis
    Psychological Methods Copyright 1997 by the American Psychological Association, Inc. 1997, Vol. 2, No. 3,292-307 1082-989X/97/$3.00 On the Meaning and Use of Kurtosis Lawrence T. DeCarlo Fordham University For symmetric unimodal distributions, positive kurtosis indicates heavy tails and peakedness relative to the normal distribution, whereas negative kurtosis indicates light tails and flatness. Many textbooks, however, describe or illustrate kurtosis incompletely or incorrectly. In this article, kurtosis is illustrated with well-known distributions, and aspects of its interpretation and misinterpretation are discussed. The role of kurtosis in testing univariate and multivariate normality; as a measure of departures from normality; in issues of robustness, outliers, and bimodality; in generalized tests and estimators, as well as limitations of and alternatives to the kurtosis measure [32, are discussed. It is typically noted in introductory statistics standard deviation. The normal distribution has a kur- courses that distributions can be characterized in tosis of 3, and 132 - 3 is often used so that the refer- terms of central tendency, variability, and shape. With ence normal distribution has a kurtosis of zero (132 - respect to shape, virtually every textbook defines and 3 is sometimes denoted as Y2)- A sample counterpart illustrates skewness. On the other hand, another as- to 132 can be obtained by replacing the population pect of shape, which is kurtosis, is either not discussed moments with the sample moments, which gives or, worse yet, is often described or illustrated incor- rectly. Kurtosis is also frequently not reported in re- ~(X i -- S)4/n search articles, in spite of the fact that virtually every b2 (•(X i - ~')2/n)2' statistical package provides a measure of kurtosis.
    [Show full text]
  • A Family of Skew-Normal Distributions for Modeling Proportions and Rates with Zeros/Ones Excess
    S S symmetry Article A Family of Skew-Normal Distributions for Modeling Proportions and Rates with Zeros/Ones Excess Guillermo Martínez-Flórez 1, Víctor Leiva 2,* , Emilio Gómez-Déniz 3 and Carolina Marchant 4 1 Departamento de Matemáticas y Estadística, Facultad de Ciencias Básicas, Universidad de Córdoba, Montería 14014, Colombia; [email protected] 2 Escuela de Ingeniería Industrial, Pontificia Universidad Católica de Valparaíso, 2362807 Valparaíso, Chile 3 Facultad de Economía, Empresa y Turismo, Universidad de Las Palmas de Gran Canaria and TIDES Institute, 35001 Canarias, Spain; [email protected] 4 Facultad de Ciencias Básicas, Universidad Católica del Maule, 3466706 Talca, Chile; [email protected] * Correspondence: [email protected] or [email protected] Received: 30 June 2020; Accepted: 19 August 2020; Published: 1 September 2020 Abstract: In this paper, we consider skew-normal distributions for constructing new a distribution which allows us to model proportions and rates with zero/one inflation as an alternative to the inflated beta distributions. The new distribution is a mixture between a Bernoulli distribution for explaining the zero/one excess and a censored skew-normal distribution for the continuous variable. The maximum likelihood method is used for parameter estimation. Observed and expected Fisher information matrices are derived to conduct likelihood-based inference in this new type skew-normal distribution. Given the flexibility of the new distributions, we are able to show, in real data scenarios, the good performance of our proposal. Keywords: beta distribution; centered skew-normal distribution; maximum-likelihood methods; Monte Carlo simulations; proportions; R software; rates; zero/one inflated data 1.
    [Show full text]
  • A Study of Ocean Wave Statistical Properties Using SNOW
    A study of ocean wave statistical properties using nonlinear, directional, phase- resolved ocean wavefield simulations Legena Henry PhD Candidate Course 2 – OE [email protected] abstract We study the statistics of wavefields obtained from phase-resolved simulations that are non-linear (up to the third order in surface potential). We model wave-wave interactions based on the fully non- linear Zakharov equations. We vary the simulated wavefield's input spectral properties: peak shape parameter, Phillips parameter and Directional Spreading Function. We then investigate the relationships between these input spectral properties and output physical properties via statistical analysis. We investigate: 1. Surface elevation distribution in response to input spectral properties, 2. Wave definition methods in an irregular wavefield with a two- dimensional wave number, 3. Wave height/wavelength distributions in response to input spectral properties, including the occurrence and spacing of large wave events (based on definitions in 2). 1. Surface elevation distribution in response to input spectral properties: – Peak Shape Parameter – Phillips' Parameter – Directional Spreading Function 1. Surface elevation distribution in response to input spectral properties: Peak shape parameter Peak shape parameter produces higher impact on even moments of surface elevation, (i.e. surface elevation kurtosis and surface elevation variance) than on the observed odd moments (skewness) of surface elevation. The highest elevations in a wavefield with mean peak shape parameter, 3.3 are more stable than those in wavefields with mean peak shape parameter, 1.0, and 5.0 We find surface elevation kurtosis in non-linear wavefields is much smaller than surface elevation slope kurtosis. We also see that higher values of peak shape parameter produce higher kurtosis of surface slope in the mean direction of propagation.
    [Show full text]
  • A Guide on Probability Distributions
    powered project A guide on probability distributions R-forge distributions Core Team University Year 2008-2009 LATEXpowered Mac OS' TeXShop edited Contents Introduction 4 I Discrete distributions 6 1 Classic discrete distribution 7 2 Not so-common discrete distribution 27 II Continuous distributions 34 3 Finite support distribution 35 4 The Gaussian family 47 5 Exponential distribution and its extensions 56 6 Chi-squared's ditribution and related extensions 75 7 Student and related distributions 84 8 Pareto family 88 9 Logistic ditribution and related extensions 108 10 Extrem Value Theory distributions 111 3 4 CONTENTS III Multivariate and generalized distributions 116 11 Generalization of common distributions 117 12 Multivariate distributions 132 13 Misc 134 Conclusion 135 Bibliography 135 A Mathematical tools 138 Introduction This guide is intended to provide a quite exhaustive (at least as I can) view on probability distri- butions. It is constructed in chapters of distribution family with a section for each distribution. Each section focuses on the tryptic: definition - estimation - application. Ultimate bibles for probability distributions are Wimmer & Altmann (1999) which lists 750 univariate discrete distributions and Johnson et al. (1994) which details continuous distributions. In the appendix, we recall the basics of probability distributions as well as \common" mathe- matical functions, cf. section A.2. And for all distribution, we use the following notations • X a random variable following a given distribution, • x a realization of this random variable, • f the density function (if it exists), • F the (cumulative) distribution function, • P (X = k) the mass probability function in k, • M the moment generating function (if it exists), • G the probability generating function (if it exists), • φ the characteristic function (if it exists), Finally all graphics are done the open source statistical software R and its numerous packages available on the Comprehensive R Archive Network (CRAN∗).
    [Show full text]
  • Procedures for Estimation of Weibull Parameters James W
    United States Department of Agriculture Procedures for Estimation of Weibull Parameters James W. Evans David E. Kretschmann David W. Green Forest Forest Products General Technical Report February Service Laboratory FPL–GTR–264 2019 Abstract Contents The primary purpose of this publication is to provide an 1 Introduction .................................................................. 1 overview of the information in the statistical literature on 2 Background .................................................................. 1 the different methods developed for fitting a Weibull distribution to an uncensored set of data and on any 3 Estimation Procedures .................................................. 1 comparisons between methods that have been studied in the 4 Historical Comparisons of Individual statistics literature. This should help the person using a Estimator Types ........................................................ 8 Weibull distribution to represent a data set realize some advantages and disadvantages of some basic methods. It 5 Other Methods of Estimating Parameters of should also help both in evaluating other studies using the Weibull Distribution .......................................... 11 different methods of Weibull parameter estimation and in 6 Discussion .................................................................. 12 discussions on American Society for Testing and Materials Standard D5457, which appears to allow a choice for the 7 Conclusion ................................................................
    [Show full text]
  • Location-Scale Distributions
    Location–Scale Distributions Linear Estimation and Probability Plotting Using MATLAB Horst Rinne Copyright: Prof. em. Dr. Horst Rinne Department of Economics and Management Science Justus–Liebig–University, Giessen, Germany Contents Preface VII List of Figures IX List of Tables XII 1 The family of location–scale distributions 1 1.1 Properties of location–scale distributions . 1 1.2 Genuine location–scale distributions — A short listing . 5 1.3 Distributions transformable to location–scale type . 11 2 Order statistics 18 2.1 Distributional concepts . 18 2.2 Moments of order statistics . 21 2.2.1 Definitions and basic formulas . 21 2.2.2 Identities, recurrence relations and approximations . 26 2.3 Functions of order statistics . 32 3 Statistical graphics 36 3.1 Some historical remarks . 36 3.2 The role of graphical methods in statistics . 38 3.2.1 Graphical versus numerical techniques . 38 3.2.2 Manipulation with graphs and graphical perception . 39 3.2.3 Graphical displays in statistics . 41 3.3 Distribution assessment by graphs . 43 3.3.1 PP–plots and QQ–plots . 43 3.3.2 Probability paper and plotting positions . 47 3.3.3 Hazard plot . 54 3.3.4 TTT–plot . 56 4 Linear estimation — Theory and methods 59 4.1 Types of sampling data . 59 IV Contents 4.2 Estimators based on moments of order statistics . 63 4.2.1 GLS estimators . 64 4.2.1.1 GLS for a general location–scale distribution . 65 4.2.1.2 GLS for a symmetric location–scale distribution . 71 4.2.1.3 GLS and censored samples .
    [Show full text]
  • Power Birnbaum-Saunders Student T Distribution
    Revista Integración Escuela de Matemáticas Universidad Industrial de Santander H Vol. 35, No. 1, 2017, pág. 51–70 Power Birnbaum-Saunders Student t distribution Germán Moreno-Arenas a ∗, Guillermo Martínez-Flórez b Heleno Bolfarine c a Universidad Industrial de Santander, Escuela de Matemáticas, Bucaramanga, Colombia. b Universidad de Córdoba, Departamento de Matemáticas y Estadística, Montería, Colombia. c Universidade de São Paulo, Departamento de Estatística, São Paulo, Brazil. Abstract. The fatigue life distribution proposed by Birnbaum and Saunders has been used quite effectively to model times to failure for materials sub- ject to fatigue. In this article, we introduce an extension of the classical Birnbaum-Saunders distribution substituting the normal distribution by the power Student t distribution. The new distribution is more flexible than the classical Birnbaum-Saunders distribution in terms of asymmetry and kurto- sis. We discuss maximum likelihood estimation of the model parameters and associated regression model. Two real data set are analysed and the results reveal that the proposed model better some other models proposed in the literature. Keywords: Birnbaum-Saunders distribution, alpha-power distribution, power Student t distribution. MSC2010: 62-07, 62F10, 62J02, 62N86. Distribución Birnbaum-Saunders Potencia t de Student Resumen. La distribución de probabilidad propuesta por Birnbaum y Saun- ders se ha usado con bastante eficacia para modelar tiempos de falla de ma- teriales sujetos a la fátiga. En este artículo definimos una extensión de la distribución Birnbaum-Saunders clásica sustituyendo la distribución normal por la distribución potencia t de Student. La nueva distribución es más flexi- ble que la distribución Birnbaum-Saunders clásica en términos de asimetría y curtosis.
    [Show full text]
  • Gamma and Related Distributions
    GAMMA AND RELATED DISTRIBUTIONS By Ayienda K. Carolynne Supervisor: Prof J.A.M Ottieno School of Mathematics University of Nairobi A thesis submitted to the School of Mathematics, University of Nairobi in partial fulfillment of the requirements for the degree of Master of Science in Statistics. November, 2013 Declaration I, Ayienda Kemunto Carolynne do hereby declare that this thesis is my original work and has not been submitted for a degree in any other University. Sign: ---------------------------------------- Date................................ AYIENDA KEMUNTO CAROLYNNE Sign: ---------------------------------------- Date............................... Supervisor: J.A.M OTTIENO i Dedication I wish to dedicate this thesis to my beloved husband Ezekiel Onyonka and son Steve Michael. To you all, I thank you for giving me the support needed to persue my academic dream. ii Acknowledgements I am grateful to the all-mighty God who has watched over me all my life and more so, during my challenging academic times. It is my prayer that he grants me more strength to achieve all my ambitions and aspirations in life. I am thankful to the University of Nairobi, which not only offered me a scholarship, but also moral support during this study. It is my pleasant privilege to acknowledge with gratitude one and all from whom I received guidance and encouragement during hard times of my study. Special thanks to my supervisor Prof. J.A.M Ottieno, School of Mathematics, University of Nairobi for his moral and material support, encouragement through which I have managed to do my work. Also I cannot forget to appreciate his invaluable guidance, tolerance and constructive criticism without which I would not have accomplished my objectives.
    [Show full text]
  • Hand-Book on STATISTICAL DISTRIBUTIONS for Experimentalists
    Internal Report SUF–PFY/96–01 Stockholm, 11 December 1996 1st revision, 31 October 1998 last modification 10 September 2007 Hand-book on STATISTICAL DISTRIBUTIONS for experimentalists by Christian Walck Particle Physics Group Fysikum University of Stockholm (e-mail: [email protected]) Contents 1 Introduction 1 1.1 Random Number Generation .............................. 1 2 Probability Density Functions 3 2.1 Introduction ........................................ 3 2.2 Moments ......................................... 3 2.2.1 Errors of Moments ................................ 4 2.3 Characteristic Function ................................. 4 2.4 Probability Generating Function ............................ 5 2.5 Cumulants ......................................... 6 2.6 Random Number Generation .............................. 7 2.6.1 Cumulative Technique .............................. 7 2.6.2 Accept-Reject technique ............................. 7 2.6.3 Composition Techniques ............................. 8 2.7 Multivariate Distributions ................................ 9 2.7.1 Multivariate Moments .............................. 9 2.7.2 Errors of Bivariate Moments .......................... 9 2.7.3 Joint Characteristic Function .......................... 10 2.7.4 Random Number Generation .......................... 11 3 Bernoulli Distribution 12 3.1 Introduction ........................................ 12 3.2 Relation to Other Distributions ............................. 12 4 Beta distribution 13 4.1 Introduction .......................................
    [Show full text]
  • Handbook on Probability Distributions
    R powered R-forge project Handbook on probability distributions R-forge distributions Core Team University Year 2009-2010 LATEXpowered Mac OS' TeXShop edited Contents Introduction 4 I Discrete distributions 6 1 Classic discrete distribution 7 2 Not so-common discrete distribution 27 II Continuous distributions 34 3 Finite support distribution 35 4 The Gaussian family 47 5 Exponential distribution and its extensions 56 6 Chi-squared's ditribution and related extensions 75 7 Student and related distributions 84 8 Pareto family 88 9 Logistic distribution and related extensions 108 10 Extrem Value Theory distributions 111 3 4 CONTENTS III Multivariate and generalized distributions 116 11 Generalization of common distributions 117 12 Multivariate distributions 133 13 Misc 135 Conclusion 137 Bibliography 137 A Mathematical tools 141 Introduction This guide is intended to provide a quite exhaustive (at least as I can) view on probability distri- butions. It is constructed in chapters of distribution family with a section for each distribution. Each section focuses on the tryptic: definition - estimation - application. Ultimate bibles for probability distributions are Wimmer & Altmann (1999) which lists 750 univariate discrete distributions and Johnson et al. (1994) which details continuous distributions. In the appendix, we recall the basics of probability distributions as well as \common" mathe- matical functions, cf. section A.2. And for all distribution, we use the following notations • X a random variable following a given distribution, • x a realization of this random variable, • f the density function (if it exists), • F the (cumulative) distribution function, • P (X = k) the mass probability function in k, • M the moment generating function (if it exists), • G the probability generating function (if it exists), • φ the characteristic function (if it exists), Finally all graphics are done the open source statistical software R and its numerous packages available on the Comprehensive R Archive Network (CRAN∗).
    [Show full text]