Introduction to Probability and Statistics

Total Page:16

File Type:pdf, Size:1020Kb

Introduction to Probability and Statistics Introduction to probability and statistics Prof. Dr. Alexander Drewitz Universit¨at zu K¨oln Preliminary version of October 4, 2019 2 Contents 1 Probability theory 5 1.1 A short history of classical probability theory . ............. 5 1.2 Mathematical modelling of random experiments . ........... 6 1.3 σ-algebras, probability measures, and probability spaces . ............. 8 1.4 Examplesofprobabilityspaces . ....... 13 1.4.1 Discrete probability spaces . ..... 13 1.4.2 Continuous probability spaces . ...... 14 1.5 Conditionalprobabilities. ........ 15 1.6 Independence .................................... 17 1.6.1 Finite products of probability spaces . ........ 19 1.7 Randomvariables................................. 20 1.8 Sums of independent random variables and convolutions . ............. 25 1.9 Specificdistributions . ...... 25 1.9.1 Discretedistributions . 26 1.9.2 Distributions with densities . ...... 30 1.10Expectation.................................... 33 1.10.1 Second (and higher) moments . 38 1.11 Generatingfunctions . ...... 43 1.12 Convergence of random variables . ........ 45 1.12.1 Almostsureconvergence. 45 1.12.2 Convergence in p ............................... 46 L 1.12.3 Convergence in probability . ..... 47 1.12.4 Convergence in distribution . ...... 47 1.13 Some fundamental tools and inequalities . ........... 49 1.13.1 Markov’s and Cheybshev’s inequalities . ......... 49 1.13.2 The Borel-Cantelli lemmas . 50 1.13.3 Jensen’sinequality . 51 1.14 Interdependence of types of convergence of random variables ........... 52 1.15 Lawsoflargenumbers . .. .. .. .. .. .. .. .. 54 1.15.1 Weak law of large numbers . 55 1.15.2 Stronglawoflargenumbers. 59 1.15.3 Are we investigating unicorns? . ...... 61 1.16 Centrallimittheorem . ..... 61 1.17Markovchains ................................... 65 1.17.1 Stationary distribution . ...... 68 1.17.2 Classification of states . ..... 74 1.17.3 The Perron-Frobenius theorem . ..... 75 1.17.4 Quantitative convergence of Markov chains to equilibrium......... 76 3 4 CONTENTS 2 SRW basics – insert at some point 83 2.0.1 Recurrence and transience of simple random walk . ........ 83 2.0.2 Gambler’sruin ................................. 84 2.0.3 Reflectionprinciple. 85 2.0.4 Ballottheorem ................................. 85 2.0.5 Arcsinelawforlastvisits . 86 2.0.6 Arcsinelawforsojourntimes . 88 3 Statistics 91 3.1 Estimators...................................... 93 3.1.1 Propertiesofestimators . 94 3.1.2 Maximum likelihood estimators . ..... 98 3.1.3 Fisher information and the Cram´er-Rao inequality . ........... 99 3.1.4 Consistencyofestimators . 102 3.2 Confidenceregions ................................ 106 3.2.1 One recipe for constructing confidence regions . ..........106 3.2.2 Orderstatistics ............................... 107 3.3 Tests........................................... 109 3.4 Testing for alternatives (‘Alternativtests’) . ...............113 3.4.1 Chi-square test of goodness of fit (‘Chiquadrat-Anpassungstest’) . 115 Chapter 1 Probability theory 1.1 A short history of classical probability theory Arguably the first historically acknowledged hour of birth of probability was a series of exchanges of letters between Pascal and de Fermat in 1654. Most prominently, they were investigating the so-called ’problem of points’, in which the setting is as follows: Two players, A and B, are consecutively playing a game in which each player has the same chance of winning. The first player who wins a fixed number of times (say, three) wins the prize money. Now imagine that due to unforeseen circumstances the tournament has to be stopped prematurely at a time when player A has won twice and player B has won once. How should the prize money be distributed? In fact, this problem had been around for some time already when Pascal and de Fermat solved it: Attempts to solve it had been given by Pacioli in 1494, Cardano in 1539 and Forestani in 1603, just to name a few. It was also suggested that the problem is in fact a jurisdictional problem, not a mathematical one. The bulk of those solutions either comes out of (mathematically) thin air and looks naive or at least implausible from today’s point of view. Either way, in the exchange of letters mentioned above, de Fermat gave a solution to the problem by essentially enumerating all possible outcomes for the remaining games and then distributing the money in the same ratio as these outcomes make player A or B win, respectively.1 Pascal, on the other hand, realised that this combinatorial approach is getting prohibitively expensive in the case of not three wins, but an arbitrary number of wins. He solved the problem recursively, thereby reducing the computational complexity and at the same time already introducing the fundamental concept of an ‘expectation’ (without naming it this way), see Definition 1.10.1. Further motivation came from the desire to gauge insurance premia; it is not completely clear, however, why it was exactly during the above centuries that probability theory came into being. In fact, gambling had been a favorite pastime for millenia,2 and also insurance and law (see e.g. [Ber12]) would have proved useful areas of application of probability theory. It may be argued that the means to ‘generate’ randomness had not been sufficiently sophisticated before, when for example in the place of dice people used so-called ‘Astragali’, which is a type of bone. Indeed, according to today’s knowledge, it was only in the 10th century that all possible results of a series of throwing a dice several times had been completely enumerated. A further impediment might have been that (excessive) gambling had been frowned upon by the Catholic Church. Cardano, who had already been mentioned above, and who also was 1Inherent to this approach is the assumption that all those possible outcomes of the remaining games are equally likely (i.e., they have the same probability, although the definition of probability in this setting is not clear either; there are at least two different interpretations, the ‘objective’ (i.e., derived through symmetries such as in dice or coin tosses – keep in mind, though, that coins and dice do usually have some asymmetries; for example, for dice one usually has a number of cavities on each side specifying the value of the dice roll – or by investigating frequencies with which certain events happen) and the ‘subjective’ (epistemic or Bayesian, where a probability is a measure of the intensity of belief). We will soon have a closer look at the frequentist motivation. 2in china, and also around 500 BC people were gambling (throwing dice?) on the streets of rome 5 6 CHAPTER 1. PROBABILITY THEORY a medic3, found a neat way around this moral difficulty when explaining why he dealt with gambling: ‘Even if gambling were altogheter an evil, still, on account of the very large number of people who play, it would seem to be a natural evil. For that very reason it ought to be discussed by a medical doctor like one of the incurable diseases.’4 1.2 Mathematical modelling of random experiments So while the intuition of probability has been around for a long time from a practical point of view, it was only in 19335 that A. N. Kolmogorov started developing an axiomatic mathematical theory, see [Kol33]. While it is worthwhile to have a look at this original article, other texts which are suitable for accompanying this lecture are the books by A. Renyi, [Ren69], I. Schneider [Sch88], G. Gigerenzer and C. Kr¨uger [GK99], as well as U. Krengel [Kre05] and R. Durrett [Dur10]. One of the principal goals of probability theory is to describe experiments which are either random or for which one does not have sufficient information or sufficient computing power to describe them deterministically. For the latter, consider e.g. the rules of playing roulette in a casino. Even after the croupier has spun the wheel as well as the ball, a player is still allowed to make a bet until ‘rien ne va plus’. However, once ball and wheel have been spun essentially all information necessary to compute the (now essentially deterministic apart from possible air currents, percussions, etc.) outcome of this particular instance of the game. Hence, if a player was fast enough to use all this information to compute the outcome of this game, she would be able to beat the house. However, without technical support (see e.g. so-called ‘roulette computers’), a random modeling of this situation is still very close to reality. In the same spirit, a sequence of dice rolls or coin tosses would be considered random experiments. We will not go into details of discussing the more philosophical question of whether randomness actually does exist or not (see also ‘Bell’s theorem’ for a more philosophical contribution on the existence of randomness). A possible outcome of an experiment such as the one described above is usually referred to as an elementary event and denoted by ω. The set of elementary events is most often denoted by Ω (so ω Ω). An event is a set A Ω of elementary events to which we want to be able to ∈ ⊂ associate a probability P(A) via a suitable function P : 2Ω [0, 1] defined on the power set → 2Ω of Ω (i.e., on the set of all subsets of Ω). We will later on see that wanting to associate a probability to each A Ω is generally asking too much and such a function P will have to be ⊂ restricted to a subset of 2Ω.6 However, as long as Ω is either finite or countable such problems will not arise. Given an elementary event ω Ω (i.e. the outcome of a single realisation of a random experi- ∈ ment) and an event A Ω, the interpretation is that the event A has occurred if ω A and A ⊂ ∈ has not occurred if ω / A. ∈ The first goal in describing experiments as outlined above is to find Ω and P in a way that is suitable for the respective experiment. Example 1.2.1.
Recommended publications
  • Diagonal Estimation with Probing Methods
    Diagonal Estimation with Probing Methods Bryan J. Kaperick Dissertation submitted to the Faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements for the degree of Master of Science in Mathematics Matthias C. Chung, Chair Julianne M. Chung Serkan Gugercin Jayanth Jagalur-Mohan May 3, 2019 Blacksburg, Virginia Keywords: Probing Methods, Numerical Linear Algebra, Computational Inverse Problems Copyright 2019, Bryan J. Kaperick Diagonal Estimation with Probing Methods Bryan J. Kaperick (ABSTRACT) Probing methods for trace estimation of large, sparse matrices has been studied for several decades. In recent years, there has been some work to extend these techniques to instead estimate the diagonal entries of these systems directly. We extend some analysis of trace estimators to their corresponding diagonal estimators, propose a new class of deterministic diagonal estimators which are well-suited to parallel architectures along with heuristic ar- guments for the design choices in their construction, and conclude with numerical results on diagonal estimation and ordering problems, demonstrating the strengths of our newly- developed methods alongside existing methods. Diagonal Estimation with Probing Methods Bryan J. Kaperick (GENERAL AUDIENCE ABSTRACT) In the past several decades, as computational resources increase, a recurring problem is that of estimating certain properties very large linear systems (matrices containing real or complex entries). One particularly important quantity is the trace of a matrix, defined as the sum of the entries along its diagonal. In this thesis, we explore a problem that has only recently been studied, in estimating the diagonal entries of a particular matrix explicitly. For these methods to be computationally more efficient than existing methods, and with favorable convergence properties, we require the matrix in question to have a majority of its entries be zero (the matrix is sparse), with the largest-magnitude entries clustered near and on its diagonal, and very large in size.
    [Show full text]
  • A Guide on Probability Distributions
    powered project A guide on probability distributions R-forge distributions Core Team University Year 2008-2009 LATEXpowered Mac OS' TeXShop edited Contents Introduction 4 I Discrete distributions 6 1 Classic discrete distribution 7 2 Not so-common discrete distribution 27 II Continuous distributions 34 3 Finite support distribution 35 4 The Gaussian family 47 5 Exponential distribution and its extensions 56 6 Chi-squared's ditribution and related extensions 75 7 Student and related distributions 84 8 Pareto family 88 9 Logistic ditribution and related extensions 108 10 Extrem Value Theory distributions 111 3 4 CONTENTS III Multivariate and generalized distributions 116 11 Generalization of common distributions 117 12 Multivariate distributions 132 13 Misc 134 Conclusion 135 Bibliography 135 A Mathematical tools 138 Introduction This guide is intended to provide a quite exhaustive (at least as I can) view on probability distri- butions. It is constructed in chapters of distribution family with a section for each distribution. Each section focuses on the tryptic: definition - estimation - application. Ultimate bibles for probability distributions are Wimmer & Altmann (1999) which lists 750 univariate discrete distributions and Johnson et al. (1994) which details continuous distributions. In the appendix, we recall the basics of probability distributions as well as \common" mathe- matical functions, cf. section A.2. And for all distribution, we use the following notations • X a random variable following a given distribution, • x a realization of this random variable, • f the density function (if it exists), • F the (cumulative) distribution function, • P (X = k) the mass probability function in k, • M the moment generating function (if it exists), • G the probability generating function (if it exists), • φ the characteristic function (if it exists), Finally all graphics are done the open source statistical software R and its numerous packages available on the Comprehensive R Archive Network (CRAN∗).
    [Show full text]
  • Probability Statistics Economists
    PROBABILITY AND STATISTICS FOR ECONOMISTS BRUCE E. HANSEN Contents Preface x Acknowledgements xi Mathematical Preparation xii Notation xiii 1 Basic Probability Theory 1 1.1 Introduction . 1 1.2 Outcomes and Events . 1 1.3 Probability Function . 3 1.4 Properties of the Probability Function . 4 1.5 Equally-Likely Outcomes . 5 1.6 Joint Events . 5 1.7 Conditional Probability . 6 1.8 Independence . 7 1.9 Law of Total Probability . 9 1.10 Bayes Rule . 10 1.11 Permutations and Combinations . 11 1.12 Sampling With and Without Replacement . 13 1.13 Poker Hands . 14 1.14 Sigma Fields* . 16 1.15 Technical Proofs* . 17 1.16 Exercises . 18 2 Random Variables 22 2.1 Introduction . 22 2.2 Random Variables . 22 2.3 Discrete Random Variables . 22 2.4 Transformations . 24 2.5 Expectation . 25 2.6 Finiteness of Expectations . 26 2.7 Distribution Function . 28 2.8 Continuous Random Variables . 29 2.9 Quantiles . 31 2.10 Density Functions . 31 2.11 Transformations of Continuous Random Variables . 34 2.12 Non-Monotonic Transformations . 36 ii CONTENTS iii 2.13 Expectation of Continuous Random Variables . 37 2.14 Finiteness of Expectations . 39 2.15 Unifying Notation . 39 2.16 Mean and Variance . 40 2.17 Moments . 42 2.18 Jensen’s Inequality . 42 2.19 Applications of Jensen’s Inequality* . 43 2.20 Symmetric Distributions . 45 2.21 Truncated Distributions . 46 2.22 Censored Distributions . 47 2.23 Moment Generating Function . 48 2.24 Cumulants . 50 2.25 Characteristic Function . 51 2.26 Expectation: Mathematical Details* . 52 2.27 Exercises .
    [Show full text]
  • Robust Inversion, Dimensionality Reduction, and Randomized Sampling
    Robust inversion, dimensionality reduction, and randomized sampling Aleksandr Aravkin Michael P. Friedlander Felix J. Herrmann Tristan van Leeuwen November 16, 2011 Abstract We consider a class of inverse problems in which the forward model is the solution operator to linear ODEs or PDEs. This class admits several di- mensionality-reduction techniques based on data averaging or sampling, which are especially useful for large-scale problems. We survey these approaches and their connection to stochastic optimization. The data-averaging approach is only viable, however, for a least-squares misfit, which is sensitive to outliers in the data and artifacts unexplained by the forward model. This motivates us to propose a robust formulation based on the Student's t-distribution of the error. We demonstrate how the corresponding penalty function, together with the sampling approach, can obtain good results for a large-scale seismic inverse problem with 50% corrupted data. Keywords inverse problems · seismic inversion · stochastic optimization · robust estimation 1 Introduction Consider the generic parameter-estimation scheme in which we conduct m exper- iments, recording the corresponding experimental input vectors fq1; q2; : : : ; qmg and observation vectors fd1; d2; : : : ; dmg. We model the data for given parameters n x 2 R by di = Fi(x)qi + i for i = 1; : : : ; m; (1.1) This work was in part financially supported by the Natural Sciences and Engineering Research Council of Canada Discovery Grant (22R81254) and the Collaborative Research and Develop- ment Grant DNOISE II (375142-08). This research was carried out as part of the SINBAD II project with support from the following organizations: BG Group, BPG, BP, Chevron, Conoco Phillips, Petrobras, PGS, Total SA, and WesternGeco.
    [Show full text]
  • Handbook on Probability Distributions
    R powered R-forge project Handbook on probability distributions R-forge distributions Core Team University Year 2009-2010 LATEXpowered Mac OS' TeXShop edited Contents Introduction 4 I Discrete distributions 6 1 Classic discrete distribution 7 2 Not so-common discrete distribution 27 II Continuous distributions 34 3 Finite support distribution 35 4 The Gaussian family 47 5 Exponential distribution and its extensions 56 6 Chi-squared's ditribution and related extensions 75 7 Student and related distributions 84 8 Pareto family 88 9 Logistic distribution and related extensions 108 10 Extrem Value Theory distributions 111 3 4 CONTENTS III Multivariate and generalized distributions 116 11 Generalization of common distributions 117 12 Multivariate distributions 133 13 Misc 135 Conclusion 137 Bibliography 137 A Mathematical tools 141 Introduction This guide is intended to provide a quite exhaustive (at least as I can) view on probability distri- butions. It is constructed in chapters of distribution family with a section for each distribution. Each section focuses on the tryptic: definition - estimation - application. Ultimate bibles for probability distributions are Wimmer & Altmann (1999) which lists 750 univariate discrete distributions and Johnson et al. (1994) which details continuous distributions. In the appendix, we recall the basics of probability distributions as well as \common" mathe- matical functions, cf. section A.2. And for all distribution, we use the following notations • X a random variable following a given distribution, • x a realization of this random variable, • f the density function (if it exists), • F the (cumulative) distribution function, • P (X = k) the mass probability function in k, • M the moment generating function (if it exists), • G the probability generating function (if it exists), • φ the characteristic function (if it exists), Finally all graphics are done the open source statistical software R and its numerous packages available on the Comprehensive R Archive Network (CRAN∗).
    [Show full text]
  • Spectral Barriers in Certification Problems
    Spectral Barriers in Certification Problems by Dmitriy Kunisky A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy Department of Mathematics New York University May, 2021 Professor Afonso S. Bandeira Professor Gérard Ben Arous © Dmitriy Kunisky all rights reserved, 2021 Dedication To my grandparents. iii Acknowledgements First and foremost, I am immensely grateful to Afonso Bandeira for his generosity with his time, his broad mathematical insight, his resourceful creativity, and of course his famous reservoir of open problems. Afonso’s wonderful knack for keeping up the momentum— for always finding some new angle or somebody new to talk to when we were stuck, for mentioning “you know, just yesterday I heard a very nice problem...” right when I could use something else to work on—made five years fly by. I have Afonso to thank for my sense of taste in research, for learning to collaborate, for remembering to enjoy myself, and above all for coming to be ever on the lookout for something new to think about. I am also indebted to Gérard Ben Arous, who, though I wound up working on topics far from his main interests, was always willing to hear out what I’d been doing, to encourage and give suggestions, and to share his truly encyclopedic knowledge of mathematics and its culture and history. On many occasions I’ve learned more tracking down what Gérard meant with the smallest aside in the classroom or in our meetings than I could manage to get out of a week on my own in the library.
    [Show full text]
  • Field Guide to Continuous Probability Distributions
    Field Guide to Continuous Probability Distributions Gavin E. Crooks v 1.0.0 2019 G. E. Crooks – Field Guide to Probability Distributions v 1.0.0 Copyright © 2010-2019 Gavin E. Crooks ISBN: 978-1-7339381-0-5 http://threeplusone.com/fieldguide Berkeley Institute for Theoretical Sciences (BITS) typeset on 2019-04-10 with XeTeX version 0.99999 fonts: Trump Mediaeval (text), Euler (math) 271828182845904 2 G. E. Crooks – Field Guide to Probability Distributions Preface: The search for GUD A common problem is that of describing the probability distribution of a single, continuous variable. A few distributions, such as the normal and exponential, were discovered in the 1800’s or earlier. But about a century ago the great statistician, Karl Pearson, realized that the known probabil- ity distributions were not sufficient to handle all of the phenomena then under investigation, and set out to create new distributions with useful properties. During the 20th century this process continued with abandon and a vast menagerie of distinct mathematical forms were discovered and invented, investigated, analyzed, rediscovered and renamed, all for the purpose of de- scribing the probability of some interesting variable. There are hundreds of named distributions and synonyms in current usage. The apparent diver- sity is unending and disorienting. Fortunately, the situation is less confused than it might at first appear. Most common, continuous, univariate, unimodal distributions can be orga- nized into a small number of distinct families, which are all special cases of a single Grand Unified Distribution. This compendium details these hun- dred or so simple distributions, their properties and their interrelations.
    [Show full text]
  • Package 'Extradistr'
    Package ‘extraDistr’ September 7, 2020 Type Package Title Additional Univariate and Multivariate Distributions Version 1.9.1 Date 2020-08-20 Author Tymoteusz Wolodzko Maintainer Tymoteusz Wolodzko <[email protected]> Description Density, distribution function, quantile function and random generation for a number of univariate and multivariate distributions. This package implements the following distributions: Bernoulli, beta-binomial, beta-negative binomial, beta prime, Bhattacharjee, Birnbaum-Saunders, bivariate normal, bivariate Poisson, categorical, Dirichlet, Dirichlet-multinomial, discrete gamma, discrete Laplace, discrete normal, discrete uniform, discrete Weibull, Frechet, gamma-Poisson, generalized extreme value, Gompertz, generalized Pareto, Gumbel, half-Cauchy, half-normal, half-t, Huber density, inverse chi-squared, inverse-gamma, Kumaraswamy, Laplace, location-scale t, logarithmic, Lomax, multivariate hypergeometric, multinomial, negative hypergeometric, non-standard beta, normal mixture, Poisson mixture, Pareto, power, reparametrized beta, Rayleigh, shifted Gompertz, Skellam, slash, triangular, truncated binomial, truncated normal, truncated Poisson, Tukey lambda, Wald, zero-inflated binomial, zero-inflated negative binomial, zero-inflated Poisson. License GPL-2 URL https://github.com/twolodzko/extraDistr BugReports https://github.com/twolodzko/extraDistr/issues Encoding UTF-8 LazyData TRUE Depends R (>= 3.1.0) LinkingTo Rcpp 1 2 R topics documented: Imports Rcpp Suggests testthat, LaplacesDemon, VGAM, evd, hoa,
    [Show full text]
  • Inference Based on the Wild Bootstrap
    Inference Based on the Wild Bootstrap James G. MacKinnon Department of Economics Queen's University Kingston, Ontario, Canada K7L 3N6 email: [email protected] Ottawa September 14, 2012 The Wild Bootstrap Consider the linear regression model 2 2 yi = Xi β + ui; E(ui ) = σi ; i = 1; : : : ; n: (1) One natural way to bootstrap this model is to use the residual bootstrap. We condition on X, β^, and the empirical distribution of the residuals (perhaps transformed). Thus the bootstrap DGP is ∗ ^ ∗ ∗ ∼ yi = Xi β + ui ; ui EDF(^ui): (2) Strong assumptions! This assumes that E(yi j Xi) = Xi β and that the error terms are IID, which implies homoskedasticity. At the opposite extreme is the pairs bootstrap, which draws bootstrap samples from the joint EDF of [yi; Xi]. This assumes that there exists such a joint EDF, but it makes no assumptions about the properties of the ui or about the functional form of E(yi j Xi). The wild bootstrap is in some ways intermediate between the residual and pairs bootstraps. It assumes that E(yi j Xi) = Xi β, but it allows for heteroskedasticity by conditioning on the (possibly transformed) residuals. If no restrictions are imposed, wild bootstrap DGP is ∗ ^ ∗ yi = Xi β + f(^ui)vi ; (3) th ∗ where f(^ui) is a transformation of the i residualu ^i, and vi has mean 0. ∗ 6 Thus E f(^ui)vi = 0 even if E f(^ui) = 0. Common choices: p w1: f(^ui) = n=(n − k)u ^i; u^ w2: f(^u ) = i ; i 1=2 (1 − hi) u^i w3: f(^ui) = : 1 − hi th Here hi is the i diagonal element of the \hat matrix" > −1 > PX ≡ X(X X) X : The w1, w2, and w3 transformations are analogous to the ones used in the HC1, HC2, and HC3 covariance matrix estimators.
    [Show full text]
  • The Generalized Poisson-Binomial Distribution and the Computation of Its Distribution Function
    The Generalized Poisson-Binomial Distribution and the Computation of Its Distribution Function Man Zhang and Yili Hong Narayanaswamy Balakrishnan Department of Statistics Department of Mathematics and Statistics Virginia Tech McMaster University Blacksburg, VA 24061, USA Hamilton, ON, L8S 4K1, Canada February 10, 2018 Abstract The Poisson-binomial distribution is useful in many applied problems in engineering, actuarial science, and data mining. The Poisson-binomial distribution models the dis- tribution of the sum of independent but non-identically distributed random indicators whose success probabilities vary. In this paper, we extend the Poisson-binomial distri- bution to a generalized Poisson-binomial (GPB) distribution. The GPB distribution corresponds to the case where the random indicators are replaced by two-point random variables, which can take two arbitrary values instead of 0 and 1 as in the case of ran- dom indicators. The GPB distribution has found applications in many areas such as voting theory, actuarial science, warranty prediction, and probability theory. As the GPB distribution has not been studied in detail so far, we introduce this distribution first and then derive its theoretical properties. We develop an efficient algorithm for the computation of its distribution function, using the fast Fourier transform. We test the accuracy of the developed algorithm by comparing it with enumeration-based exact method and the results from the binomial distribution. We also study the computational time of the algorithm under various parameter settings. Finally, we discuss the factors affecting the computational efficiency of the proposed algorithm, and illustrate the use of the software package. Key Words: Actuarial Science; Discrete Distribution; Fast Fourier Transform; Rademacher Distribution; Voting Theory; Warranty Cost Prediction.
    [Show full text]
  • Probability Distribution Relationships
    STATISTICA, anno LXX, n. 1, 2010 PROBABILITY DISTRIBUTION RELATIONSHIPS Y.H. Abdelkader, Z.A. Al-Marzouq 1. INTRODUCTION In spite of the variety of the probability distributions, many of them are related to each other by different kinds of relationship. Deriving the probability distribu- tion from other probability distributions are useful in different situations, for ex- ample, parameter estimations, simulation, and finding the probability of a certain distribution depends on a table of another distribution. The relationships among the probability distributions could be one of the two classifications: the transfor- mations and limiting distributions. In the transformations, there are three most popular techniques for finding a probability distribution from another one. These three techniques are: 1 - The cumulative distribution function technique 2 - The transformation technique 3 - The moment generating function technique. The main idea of these techniques works as follows: For given functions gin(,XX12 ,...,) X, for ik 1, 2, ..., where the joint dis- tribution of random variables (r.v.’s) XX12, , ..., Xn is given, we define the func- tions YgXXii (12 , , ..., X n ), i 1, 2, ..., k (1) The joint distribution of YY12, , ..., Yn can be determined by one of the suit- able method sated above. In particular, for k 1, we seek the distribution of YgX () (2) For some function g(X ) and a given r.v. X . The equation (1) may be linear or non-linear equation. In the case of linearity, it could be taken the form n YaX ¦ ii (3) i 1 42 Y.H. Abdelkader, Z.A. Al-Marzouq Many distributions, for this linear transformation, give the same distributions for different values for ai such as: normal, gamma, chi-square and Cauchy for continuous distributions and Poisson, binomial, negative binomial for discrete distributions as indicated in the Figures by double rectangles.
    [Show full text]
  • On Strict Sub-Gaussianity, Optimal Proxy Variance and Symmetry for Bounded Random Variables Julyan Arbel, Olivier Marchal, Hien Nguyen
    On strict sub-Gaussianity, optimal proxy variance and symmetry for bounded random variables Julyan Arbel, Olivier Marchal, Hien Nguyen To cite this version: Julyan Arbel, Olivier Marchal, Hien Nguyen. On strict sub-Gaussianity, optimal proxy variance and symmetry for bounded random variables. ESAIM: Probability and Statistics, EDP Sciences, 2020, 24, pp.39-55. 10.1051/ps/2019018. hal-01998252 HAL Id: hal-01998252 https://hal.archives-ouvertes.fr/hal-01998252 Submitted on 29 Jan 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. On strict sub-Gaussianity, optimal proxy variance and symmetry for bounded random variables Julyan Arbel1∗, Olivier Marchal2, Hien D. Nguyen3 ∗Corresponding author, email: [email protected]. 1Universit´eGrenoble Alpes, Inria, CNRS, Grenoble INP, LJK, 38000 Grenoble, France. 2Universit´ede Lyon, CNRS UMR 5208, Universit´eJean Monnet, Institut Camille Jordan, 69000 Lyon, France. 3Department of Mathematics and Statistics, La Trobe University, Bundoora Melbourne 3086, Victoria Australia. Abstract We investigate the sub-Gaussian property for almost surely bounded random variables. If sub-Gaussianity per se is de facto ensured by the bounded support of said random variables, then exciting research avenues remain open.
    [Show full text]