DISTRIBUTIONS for ACTUARIES David Bahnemann

Total Page:16

File Type:pdf, Size:1020Kb

DISTRIBUTIONS for ACTUARIES David Bahnemann CAS MONOGRAPH SERIES NUMBER 2 DISTRIBUTIONS FOR ACTUARIES David Bahnemann CASUALTY ACTUARIAL SOCIETY This monograph contains a brief exposition of the standard probability distributions and their applications by property/casualty actuaries. The focus is on the use of parametric distributions fitted to empirical claim data to solve standard actuarial problems, such as creation of increased limit factors, pricing of deductibles, and evaluating the effect of aggregate limits. A native of Minnesota, David Bahnemann studied mathematics and statistics at the University of Minnesota and at Stanford University. After teaching mathematics at Northwest Missouri State University for 18 years, he joined the actuarial department at the St. Paul Companies in St. Paul, Minnesota. For the next 25 years he provided actuarial support to several excess and surplus lines underwriting departments. While at the St. Paul (later known as St. Paul Travelers, and then Travelers) he was involved in large-account and program pricing. During this time he also created several computer- based pricing tools for use by both underwriters and actuaries. During retirement he divides his time between White Bear Lake and Burntside Lake in Minnesota. DISTRIBUTIONS FOR ACTUARIES David Bahnemann Casualty Actuarial Society 4350 North Fairfax Drive, Suite 250 Arlington, Virginia 22203 www.casact.org (703) 276-3100 Distributions for Actuaries By David Bahnemann Copyright 2015 by the Casualty Actuarial Society. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or trans- mitted, in any form or by any means. Electronic, mechanical, photocopying, recording, or otherwise, without the prior written permission of the publisher. For information on obtaining permission for use of material in this work, please submit a written request to the Casualty Actuarial Society. Library of Congress Cataloging-in-Publication Data Bahnemann, David Distributions for Actuaries / David Bahnemann 978-0-9624762-8-0 (print edition) 978-0-9624762-9-7 (electronic edition) 1. Actuarial science. 2. Distribution (Probability theory) 3. Insurance—Mathematical models. I. Bahnemann, David. Copyright 2015, Casualty Actuarial Society To Abbie, Lisa & Greta 2015 CAS Monograph Editorial Board C. K. Stan Khury, Editor in Chief Emmanuel Bardis Craig Davis Richard Fein Jesse Groman Ali Ishaq Leslie Marlo Sholom Feldblum, consultant Glenn Meyers, consultant Katya Prell, consultant Contents Foreword ............................................................................................................... vii Preface .....................................................................................................................ix Chapter 1 Introduction ................................................................................ 1 1.1 Probability Spaces ...................................................................................1 1.2 Random Variables and Probability Distributions ....................................7 1.3 Mathematical Expectation ...................................................................20 1.4 Random Samples .................................................................................25 1.5 Fitting Distributions ............................................................................28 1.6 Problems ..............................................................................................33 Chapter 2 Claim Size ...................................................................................37 2.1 Claim-Size Random Variables ..............................................................37 2.2 Limited Moments ................................................................................40 2.3 Gamma Distributions ..........................................................................46 2.4 Lognormal Distributions......................................................................52 2.5 Pareto Distributions .............................................................................55 2.6 Estimation with Modified Data ...........................................................60 2.7 Transformations ...................................................................................64 2.8 Inflation Effects....................................................................................68 2.9 Problems ..............................................................................................71 Chapter 3 Claim Counts ..............................................................................78 3.1 An Elementary Claim Process ..............................................................78 3.2 Poisson Claim Processes .......................................................................80 3.3 Parameter Uncertainty .........................................................................85 3.4 Negative Binomial Distributions ..........................................................88 3.5 Claim Contagion .................................................................................92 3.6 Portfolio Claims ...................................................................................98 3.7 Problems ............................................................................................100 Chapter 4 Aggregate Claims ......................................................................106 4.1 A Discrete Example ............................................................................106 4.2 Aggregate Distribution Properties ......................................................107 4.3 Approximation by Matching Moments ..............................................113 4.4 Recursion ...........................................................................................119 Contents 4.5 Fourier Approximation ......................................................................124 4.6 Discontinuities ...................................................................................128 4.7 Simulation .........................................................................................129 4.8 Problems ............................................................................................137 Chapter 5 Excess Claims ............................................................................142 5.1 Excess Claim Size ...............................................................................142 5.2 Excess Severity ...................................................................................145 5.3 Layers of Coverage .............................................................................149 5.4 Excess Claim Counts .........................................................................152 5.5 Inflation Effects..................................................................................153 5.6 Aggregate Layer Claims ......................................................................156 5.7 Problems ............................................................................................158 Chapter 6 Limits and Deductibles .............................................................162 6.1 Premium Concepts ............................................................................162 6.2 Increased Limit Factors ......................................................................165 6.3 Risk Load ...........................................................................................171 6.4 Aggregate Limits ................................................................................174 6.5 Deductibles ........................................................................................175 6.6 Problems ............................................................................................182 Appendix .....................................................................................................188 A.1 Distribution Approximation ..............................................................188 A.2 Answers to Selected Problems ............................................................191 A.3 References ..........................................................................................200 vi Casualty Actuarial Society Foreword This is the second monograph in the recently introduced CAS Monograph Series. A CAS monograph is an authoritative, peer reviewed, in-depth work on an important topic within the property and casualty actuarial practice. In this monograph David Bahnemann brings together two perennially important elements of actuarial practice: a solid academic presentation of parametric distributions coupled with the application of these distributions in the actuarial paradigm. Bahnemann taught mathematics at the university level for nineteen years, thus developing an excellent appreciation for what works and what does not work in presenting and conveying technical subject matter. Following that, he worked for more than two decades in applying this knowledge to all types of real actuarial problems that actuaries face every day. Hence, we have this rare presentation of mathematics that actuaries use whenever distributions are involved. This monograph is useful for those wishing to learn the subject matter for the first time as well as for practicing actuaries who wish to have in their bookcase a “desk reference manual” for use whenever faced with a problem involving parametric distributions. This work clearly is a labor of love in which Bahnemann has brought together in a single volume his entire professional life experience
Recommended publications
  • Severity Models - Special Families of Distributions
    Severity Models - Special Families of Distributions Sections 5.3-5.4 Stat 477 - Loss Models Sections 5.3-5.4 (Stat 477) Claim Severity Models Brian Hartman - BYU 1 / 1 Introduction Introduction Given that a claim occurs, the (individual) claim size X is typically referred to as claim severity. While typically this may be of continuous random variables, sometimes claim sizes can be considered discrete. When modeling claims severity, insurers are usually concerned with the tails of the distribution. There are certain types of insurance contracts with what are called long tails. Sections 5.3-5.4 (Stat 477) Claim Severity Models Brian Hartman - BYU 2 / 1 Introduction parametric class Parametric distributions A parametric distribution consists of a set of distribution functions with each member determined by specifying one or more values called \parameters". The parameter is fixed and finite, and it could be a one value or several in which case we call it a vector of parameters. Parameter vector could be denoted by θ. The Normal distribution is a clear example: 1 −(x−µ)2=2σ2 fX (x) = p e ; 2πσ where the parameter vector θ = (µ, σ). It is well known that µ is the mean and σ2 is the variance. Some important properties: Standard Normal when µ = 0 and σ = 1. X−µ If X ∼ N(µ, σ), then Z = σ ∼ N(0; 1). Sums of Normal random variables is again Normal. If X ∼ N(µ, σ), then cX ∼ N(cµ, cσ). Sections 5.3-5.4 (Stat 477) Claim Severity Models Brian Hartman - BYU 3 / 1 Introduction some claim size distributions Some parametric claim size distributions Normal - easy to work with, but careful with getting negative claims.
    [Show full text]
  • Package 'Distributional'
    Package ‘distributional’ February 2, 2021 Title Vectorised Probability Distributions Version 0.2.2 Description Vectorised distribution objects with tools for manipulating, visualising, and using probability distributions. Designed to allow model prediction outputs to return distributions rather than their parameters, allowing users to directly interact with predictive distributions in a data-oriented workflow. In addition to providing generic replacements for p/d/q/r functions, other useful statistics can be computed including means, variances, intervals, and highest density regions. License GPL-3 Imports vctrs (>= 0.3.0), rlang (>= 0.4.5), generics, ellipsis, stats, numDeriv, ggplot2, scales, farver, digest, utils, lifecycle Suggests testthat (>= 2.1.0), covr, mvtnorm, actuar, ggdist RdMacros lifecycle URL https://pkg.mitchelloharawild.com/distributional/, https: //github.com/mitchelloharawild/distributional BugReports https://github.com/mitchelloharawild/distributional/issues Encoding UTF-8 Language en-GB LazyData true Roxygen list(markdown = TRUE, roclets=c('rd', 'collate', 'namespace')) RoxygenNote 7.1.1 1 2 R topics documented: R topics documented: autoplot.distribution . .3 cdf..............................................4 density.distribution . .4 dist_bernoulli . .5 dist_beta . .6 dist_binomial . .7 dist_burr . .8 dist_cauchy . .9 dist_chisq . 10 dist_degenerate . 11 dist_exponential . 12 dist_f . 13 dist_gamma . 14 dist_geometric . 16 dist_gumbel . 17 dist_hypergeometric . 18 dist_inflated . 20 dist_inverse_exponential . 20 dist_inverse_gamma
    [Show full text]
  • Clinical Trial Design As a Decision Problem
    Special Issue Paper Received 2 July 2016, Revised 15 November 2016, Accepted 19 November 2016 Published online 13 January 2017 in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/asmb.2222 Clinical trial design as a decision problem Peter Müllera*†, Yanxun Xub and Peter F. Thallc The intent of this discussion is to highlight opportunities and limitations of utility-based and decision theoretic arguments in clinical trial design. The discussion is based on a specific case study, but the arguments and principles remain valid in general. The exam- ple concerns the design of a randomized clinical trial to compare a gel sealant versus standard care for resolving air leaks after pulmonary resection. The design follows a principled approach to optimal decision making, including a probability model for the unknown distributions of time to resolution of air leaks under the two treatment arms and an explicit utility function that quan- tifies clinical preferences for alternative outcomes. As is typical for any real application, the final implementation includessome compromises from the initial principled setup. In particular, we use the formal decision problem only for the final decision, but use reasonable ad hoc decision boundaries for making interim group sequential decisions that stop the trial early. Beyond the discussion of the particular study, we review more general considerations of using a decision theoretic approach for clinical trial design and summarize some of the reasons why such approaches are not commonly used. Copyright © 2017 John Wiley & Sons, Ltd. Keywords: Bayesian decision problem; Bayes rule; nonparametric Bayes; optimal design; sequential stopping 1. Introduction We discuss opportunities and practical limitations of approaching clinical trial design as a formal decision problem.
    [Show full text]
  • Bivariate Burr Distributions by Extending the Defining Equation
    DECLARATION This thesis contains no material which has been accepted for the award of any other degree or diploma in any university and to the best ofmy knowledge and belief, it contains no material previously published by any other person, except where due references are made in the text ofthe thesis !~ ~ Cochin-22 BISMI.G.NADH June 2005 ACKNOWLEDGEMENTS I wish to express my profound gratitude to Dr. V.K. Ramachandran Nair, Professor, Department of Statistics, CUSAT for his guidance, encouragement and support through out the development ofthis thesis. I am deeply indebted to Dr. N. Unnikrishnan Nair, Fonner Vice Chancellor, CUSAT for the motivation, persistent encouragement and constructive criticism through out the development and writing of this thesis. The co-operation rendered by all teachers in the Department of Statistics, CUSAT is also thankfully acknowledged. I also wish to express great happiness at the steadfast encouragement rendered by my family. Finally I dedicate this thesis to the loving memory of my father Late. Sri. C.K. Gopinath. BISMI.G.NADH CONTENTS Page CHAPTER I INTRODUCTION 1 1.1 Families ofDistributions 1.2· Burr system 2 1.3 Present study 17 CHAPTER II BIVARIATE BURR SYSTEM OF DISTRIBUTIONS 18 2.1 Introduction 18 2.2 Bivariate Burr system 21 2.3 General Properties of Bivariate Burr system 23 2.4 Members ofbivariate Burr system 27 CHAPTER III CHARACTERIZATIONS OF BIVARIATE BURR 33 SYSTEM 3.1 Introduction 33 3.2 Characterizations ofbivariate Burr system using reliability 33 concepts 3.3 Characterization ofbivariate
    [Show full text]
  • Loss Distributions
    Munich Personal RePEc Archive Loss Distributions Burnecki, Krzysztof and Misiorek, Adam and Weron, Rafal 2010 Online at https://mpra.ub.uni-muenchen.de/22163/ MPRA Paper No. 22163, posted 20 Apr 2010 10:14 UTC Loss Distributions1 Krzysztof Burneckia, Adam Misiorekb,c, Rafał Weronb aHugo Steinhaus Center, Wrocław University of Technology, 50-370 Wrocław, Poland bInstitute of Organization and Management, Wrocław University of Technology, 50-370 Wrocław, Poland cSantander Consumer Bank S.A., Wrocław, Poland Abstract This paper is intended as a guide to statistical inference for loss distributions. There are three basic approaches to deriving the loss distribution in an insurance risk model: empirical, analytical, and moment based. The empirical method is based on a sufficiently smooth and accurate estimate of the cumulative distribution function (cdf) and can be used only when large data sets are available. The analytical approach is probably the most often used in practice and certainly the most frequently adopted in the actuarial literature. It reduces to finding a suitable analytical expression which fits the observed data well and which is easy to handle. In some applications the exact shape of the loss distribution is not required. We may then use the moment based approach, which consists of estimating only the lowest characteristics (moments) of the distribution, like the mean and variance. Having a large collection of distributions to choose from, we need to narrow our selection to a single model and a unique parameter estimate. The type of the objective loss distribution can be easily selected by comparing the shapes of the empirical and theoretical mean excess functions.
    [Show full text]
  • Structural Equation Models and Mixture Models with Continuous Non-Normal Skewed Distributions
    Structural Equation Models And Mixture Models With Continuous Non-Normal Skewed Distributions Tihomir Asparouhov and Bengt Muth´en Mplus Web Notes: No. 19 Version 2 July 3, 2014 1 Abstract In this paper we describe a structural equation modeling framework that allows non-normal skewed distributions for the continuous observed and latent variables. This framework is based on the multivariate restricted skew t-distribution. We demonstrate the advantages of skewed structural equation modeling over standard SEM modeling and challenge the notion that structural equation models should be based only on sample means and covariances. The skewed continuous distributions are also very useful in finite mixture modeling as they prevent the formation of spurious classes formed purely to compensate for deviations in the distributions from the standard bell curve distribution. This framework is implemented in Mplus Version 7.2. 2 1 Introduction Standard structural equation models reduce data modeling down to fitting means and covariances. All other information contained in the data is ignored. In this paper, we expand the standard structural equation model framework to take into account the skewness and kurtosis of the data in addition to the means and the covariances. This new framework looks deeper into the data to yield a more informative structural equation model. There is a preconceived notion that standard structural equation models are sufficient as long as the standard errors of the parameter estimates are adjusted for failure to meet the normality assumption, but this is not correct. Even with robust estimation, the data are reduced to means and covariances. Only the standard errors of the parameter estimates extract additional information from the data.
    [Show full text]
  • Normal Variance Mixtures: Distribution, Density and Parameter Estimation
    Normal variance mixtures: Distribution, density and parameter estimation Erik Hintz1, Marius Hofert2, Christiane Lemieux3 2020-06-16 Abstract Normal variance mixtures are a class of multivariate distributions that generalize the multivariate normal by randomizing (or mixing) the covariance matrix via multiplication by a non-negative random variable W . The multivariate t distribution is an example of such mixture, where W has an inverse-gamma distribution. Algorithms to compute the joint distribution function and perform parameter estimation for the multivariate normal and t (with integer degrees of freedom) can be found in the literature and are implemented in, e.g., the R package mvtnorm. In this paper, efficient algorithms to perform these tasks in the general case of a normal variance mixture are proposed. In addition to the above two tasks, the evaluation of the joint (logarithmic) density function of a general normal variance mixture is tackled as well, as it is needed for parameter estimation and does not always exist in closed form in this more general setup. For the evaluation of the joint distribution function, the proposed algorithms apply randomized quasi-Monte Carlo (RQMC) methods in a way that improves upon existing methods proposed for the multivariate normal and t distributions. An adaptive RQMC algorithm that similarly exploits the superior convergence properties of RQMC methods is presented for the task of evaluating the joint log-density function. In turn, this allows the parameter estimation task to be accomplished via an expectation-maximization- like algorithm where all weights and log-densities are numerically estimated. It is demonstrated through numerical examples that the suggested algorithms are quite fast; even for high dimensions around 1000 the distribution function can be estimated with moderate accuracy using only a few seconds of run time.
    [Show full text]
  • Handbook on Probability Distributions
    R powered R-forge project Handbook on probability distributions R-forge distributions Core Team University Year 2009-2010 LATEXpowered Mac OS' TeXShop edited Contents Introduction 4 I Discrete distributions 6 1 Classic discrete distribution 7 2 Not so-common discrete distribution 27 II Continuous distributions 34 3 Finite support distribution 35 4 The Gaussian family 47 5 Exponential distribution and its extensions 56 6 Chi-squared's ditribution and related extensions 75 7 Student and related distributions 84 8 Pareto family 88 9 Logistic distribution and related extensions 108 10 Extrem Value Theory distributions 111 3 4 CONTENTS III Multivariate and generalized distributions 116 11 Generalization of common distributions 117 12 Multivariate distributions 133 13 Misc 135 Conclusion 137 Bibliography 137 A Mathematical tools 141 Introduction This guide is intended to provide a quite exhaustive (at least as I can) view on probability distri- butions. It is constructed in chapters of distribution family with a section for each distribution. Each section focuses on the tryptic: definition - estimation - application. Ultimate bibles for probability distributions are Wimmer & Altmann (1999) which lists 750 univariate discrete distributions and Johnson et al. (1994) which details continuous distributions. In the appendix, we recall the basics of probability distributions as well as \common" mathe- matical functions, cf. section A.2. And for all distribution, we use the following notations • X a random variable following a given distribution, • x a realization of this random variable, • f the density function (if it exists), • F the (cumulative) distribution function, • P (X = k) the mass probability function in k, • M the moment generating function (if it exists), • G the probability generating function (if it exists), • φ the characteristic function (if it exists), Finally all graphics are done the open source statistical software R and its numerous packages available on the Comprehensive R Archive Network (CRAN∗).
    [Show full text]
  • Tables for Exam STAM the Reading Material for Exam STAM Includes a Variety of Textbooks
    Tables for Exam STAM The reading material for Exam STAM includes a variety of textbooks. Each text has a set of probability distributions that are used in its readings. For those distributions used in more than one text, the choices of parameterization may not be the same in all of the books. This may be of educational value while you study, but could add a layer of uncertainty in the examination. For this latter reason, we have adopted one set of parameterizations to be used in examinations. This set will be based on Appendices A & B of Loss Models: From Data to Decisions by Klugman, Panjer and Willmot. A slightly revised version of these appendices is included in this note. A copy of this note will also be distributed to each candidate at the examination. Each text also has its own system of dedicated notation and terminology. Sometimes these may conflict. If alternative meanings could apply in an examination question, the symbols will be defined. For Exam STAM, in addition to the abridged table from Loss Models, sets of values from the standard normal and chi-square distributions will be available for use in examinations. These are also included in this note. When using the normal distribution, choose the nearest z-value to find the probability, or if the probability is given, choose the nearest z-value. No interpolation should be used. Example: If the given z-value is 0.759, and you need to find Pr(Z < 0.759) from the normal distribution table, then choose the probability for z-value = 0.76: Pr(Z < 0.76) = 0.7764.
    [Show full text]
  • On Curved Exponential Families
    U.U.D.M. Project Report 2019:10 On Curved Exponential Families Emma Angetun Examensarbete i matematik, 15 hp Handledare: Silvelyn Zwanzig Examinator: Örjan Stenflo Mars 2019 Department of Mathematics Uppsala University On Curved Exponential Families Emma Angetun March 4, 2019 Abstract This paper covers theory of inference statistical models that belongs to curved exponential families. Some of these models are the normal distribution, binomial distribution, bivariate normal distribution and the SSR model. The purpose was to evaluate the belonging properties such as sufficiency, completeness and strictly k-parametric. Where it was shown that sufficiency holds and therefore the Rao Blackwell The- orem but completeness does not so Lehmann Scheffé Theorem cannot be applied. 1 Contents 1 exponential families ...................... 3 1.1 natural parameter space ................... 6 2 curved exponential families ................. 8 2.1 Natural parameter space of curved families . 15 3 Sufficiency ............................ 15 4 Completeness .......................... 18 5 the rao-blackwell and lehmann-scheffé theorems .. 20 2 1 exponential families Statistical inference is concerned with looking for a way to use the informa- tion in observations x from the sample space X to get information about the partly unknown distribution of the random variable X. Essentially one want to find a function called statistic that describe the data without loss of im- portant information. The exponential family is in probability and statistics a class of probability measures which can be written in a certain form. The exponential family is a useful tool because if a conclusion can be made that the given statistical model or sample distribution belongs to the class then one can thereon apply the general framework that the class gives.
    [Show full text]
  • A New Generalized Burr Family of Distributions Based on Quantile Function -.:: Natural Sciences Publishing
    J. Stat. Appl. Pro. 6, No. 3, 499-504 (2017) 499 Journal of Statistics Applications & Probability An International Journal http://dx.doi.org/10.18576/jsap/060306 A New Generalized Burr Family of Distributions Based on Quantile Function Arslan Nasir1, Mohammad Aljarrah2, Farrukh Jamal3,∗ and M. H. Tahir4 1 Goverment Degree College Lodhran, Punjab, Pakistan. 2 Department of Mathematics, Tafila Technical University, At Tafila 66110, Jordan. 3 Goverment S.A. Post Graduate College Dera Nawab, Punjab, Pakistan. 4 Department of Statistics, The Islamia University Bahawalpur, Bahawalpur-63100, Pakistan. Received: 29 May 2017, Revised: 20 Aug. 2017, Accepted: 25 Aug. 2017 Published online: 1 Nov. 2017 Abstract: We propose a new family from Burr XII distribution, called T-Burr family of distributions based on the T-R{Y} framework. For this family, we consider the quantile functions of three well-known distributions, namely, Lomax, logistic and Weibull, and further developed three sub-families T-Burr{Lomax}, T-Burr{Log-logistic} and T-Burr{Weibull}. Some mathematical properties such as quantile function, mode, Shannon entropy, moments, and mean deviations, of T-R{Y} family are obtained. One special model, namely, Weibull-Burr{Lomax} from T-Burr{Lomax} family is considered and its properties are obtained. This model is flexible and can produce the shapes of the density such as left-skewed, right-skewed, symmetrical, J, and reversed-J, and can have constant, increasing and decreasing hazard rate shapes. The usefulness of this model is demonstrated through applications to censored and complete data sets. Keywords: Burr XII distribution, generalization, quantile function, T-X family, T-R{Y} family.
    [Show full text]
  • Maximum Likelihood Characterization of Distributions
    Bernoulli 20(2), 2014, 775–802 DOI: 10.3150/13-BEJ506 Maximum likelihood characterization of distributions MITIA DUERINCKX1,*, CHRISTOPHE LEY1,** and YVIK SWAN2 1D´epartement de Math´ematique, Universit´elibre de Bruxelles, Campus Plaine CP 210, Boule- vard du triomphe, B-1050 Bruxelles, Belgique. * ** E-mail: [email protected]; [email protected] 2 Universit´edu Luxembourg, Campus Kirchberg, UR en math´ematiques, 6, rue Richard Couden- hove-Kalergi, L-1359 Luxembourg, Grand-Duch´ede Luxembourg. E-mail: [email protected] A famous characterization theorem due to C.F. Gauss states that the maximum likelihood es- timator (MLE) of the parameter in a location family is the sample mean for all samples of all sample sizes if and only if the family is Gaussian. There exist many extensions of this re- sult in diverse directions, most of them focussing on location and scale families. In this paper, we propose a unified treatment of this literature by providing general MLE characterization theorems for one-parameter group families (with particular attention on location and scale pa- rameters). In doing so, we provide tools for determining whether or not a given such family is MLE-characterizable, and, in case it is, we define the fundamental concept of minimal necessary sample size at which a given characterization holds. Many of the cornerstone references on this topic are retrieved and discussed in the light of our findings, and several new characterization theorems are provided. Of particular interest is that one part of our work, namely the intro- duction of so-called equivalence classes for MLE characterizations, is a modernized version of Daniel Bernoulli’s viewpoint on maximum likelihood estimation.
    [Show full text]