ON a MULTIVARIATE PARETO DISTRIBUTION 1 Alexandru V

Total Page:16

File Type:pdf, Size:1020Kb

ON a MULTIVARIATE PARETO DISTRIBUTION 1 Alexandru V ON A MULTIVARIATE PARETO DISTRIBUTION 1 Alexandru V. Asimit School of Mathematics, University of Manchester, Manchester M13 9PL, United Kingdom. E-mail: [email protected] Edward Furman2; 3 Department of Mathematics and Statistics, York University, Toronto, Ontario M3J 1P3, Canada. E-mail: [email protected] Raluca Vernic Faculty of Mathematics and Informatics, Ovidius University of Constanta, Constanta, Romania. E-mail: [email protected] Abstract. A multivariate distribution possessing arbitrarily parameter- ized Pareto margins is formulated and studied. The distribution is believed to allow for an adequate modeling of dependent heavy tailed risks with a non-zero probability of simultaneous loss. Numerous links to certain nowa- days existing probabilistic models, as well as seemingly useful character- istic results are proved. Expressions for, e.g., decumulative distribution functions, densities, (joint) moments and regressions are developed. An application to the classical pricing problem is considered, and some formu- las are derived using the recently introduced economic weighted premium calculation principles. Keywords and phrases: Multivariate Pareto distributions, characteriza- tions, mixtures, dependence, simultaneous loss, economic weighted pricing. Mathematics Subject Classi¯cation: IM01, IM10, IM54. 1We are indebted to Professors Gheoghita Zbaganu (University of Bucharest, Romania) and Ri·cardas Zitikis (University of Western Ontario, Canada) for many illuminating discussions, and we also sincerely thank the anonymous referee for helpful comments and suggestions. 2The ¯nancial support of Natural Sciences and Engineering Research Council (NSERC) of Canada is gratefully appreciated. 3Corresponding author. 2 1. Introduction Pareto probability laws were formulated by Pareto (1897) and initially dealt with the distribution of income over a population. They are nowadays popular in describing a variety of other phenomena such as city population sizes, occurrence of natural resources, stock price fluctuations, to name a few. In actuarial mathematics, the excess-of-loss random variable (rv) Y = X ¡d given that X > d, possesses the decumulative distribution function (ddf) F X;d(x) = P[X ¡ d > xj X > d]; (1.1) and it is of central importance in both life and non-life insurance contexts. Loosely speaking, ddf (1.1) on the one hand describes a risk inherent in a reinsurance treaty bearing a deductible d, and on the other hand it corresponds to the age-at-death of a new born child given survival to age d. Interestingly, it can be shown that for large d, appropriately chosen function σ(d) > 0 and a constant ® > 0, the approximation µ ¶ x ¡® F (x) ¼ 1 + ; x > 0 (1.2) X;d σ(d) holds for a large class of distribution functions (see, e.g., Balkema and de Haan, 1974; Pickands, 1975). Thus, the excess-of-loss rv can be generally approximated by a Pareto of the second kind. This, in turn, immediately hints at utilizing this probability law for modeling, e.g., catastrophe insurance treaties, which are known to be characterized by rather heavy tailed risks. In view of the high popularity of the univariate Pareto distributions and the aforemen- tioned implication of equation (1.2), it is quite natural to ask for a multivariate extension. Indeed, the concept of dependence, unfairly neglected in the `classical' actuarial science, has been receiving its merited attention in the recent years, thus leading to numerous papers touching on various aspects of the multivariate distribution theory with applica- tions to insurance (see, e.g., Vernic, 1997, 2000; Pfeifer and Ne·slehov¶a,2004; Bauerle and Grubel, 2005; Roger et al., 2005; Centeno, 2005; Furman, 2008; Furman and Landsman, 2008, 2009; Chiragiev and Landsman, 2009). Recently, probably due to the observation formulated by equation (1.2), the multivariate Pareto distribution of the second kind having the ddf à !¡® Xn x ¡ ¹ F (x ; : : : ; x ) = 1 + j j ; x > ¹ ; (1.3) 1 n σ j j j=1 j 3 where ¹j 2 (¡1; 1) := R, σj 2 (0; 1) := R+ and j = 1; 2; : : : ; n, has been employed to model insurance risks in the context of risk capital allocations and optimal reinsur- ance retentions by Chiragiev and Landsman (2007) (see, also, Vernic, 2009) and Cai and Tan (2007), respectively. In what follows, we denote the random vectors distributed ¤ multivariate Pareto of the second kind with ddf (1.3) as X v P an(II)(¹; σ; ®), where 0 n n 0 n n ¹ = (¹1; : : : ; ¹n) 2 (¡1; 1) := R and σ = (σ1; : : : ; σn) 2 (0; 1) := R+ are two constant vectors, and ® is a positive constant. Although the model seems to be quite tractable, it inconveniently results in common shape parameters for all univariate mar- gins Xj; j = 1; 2; : : : ; n, and thus imposes the somewhat restrictive correlation coe±cient given by 1 Corr[X ;X ] = ; for all j 6= l 2 f1; 2; : : : ; ng and ® > 2: j l ® Another well-known inconvenience associated with (1.3) is that it does not allow to model independent Pareto distributed risks. In the present paper we propose an alternative to (1.3) multivariate generalization of the univariate Pareto probability laws. The probabilistic model introduced herein is constructed using the multivariate reduction method (see, e.g., Furman and Landsman, 2009; and references therein). The motivation for, as well as the interpretation of the aforementioned construction stem from, e.g., the background economy and the common shock models (see, e.g., Tsanakas, 2008; Boucher et al., 2008). The resulting multivariate Pareto, denoted in the sequel as X v P an(II)(¹; σ; ®; ®), with ¹, σ and ® as before, 0 n and ® = (®1; : : : ; ®n) 2 R+, is marginally closed, allows for non-negative probabilities of simultaneous losses and possesses a more flexible than (1.3) dependence structure. In addition, setting ® ´ 0 yields a probabilistic model having independent Pareto distributed margins. The rest of the paper is organized as follows: In Section 2 the multivariate Pareto of interest is introduced, and its various properties are derived. In Section 3 the discussion is specialized to the bivariate case, for which the (joint) moments, the conditional dis- tributions along with the conditional moments are developed. Finally, in Section 4 it is shown that the regression function of the multivariate Pareto introduced herein is `sepa- rable', which is then employed to facilitate its applications to insurance pricing. Section 5 concludes the paper. 4 2. The multivariate Pareto of the second kind In the sequel, we ¯x the probability space (­; F; P), and we are interested in construct- 0 ing a random vector X = (X1;:::;Xn) , which is a map from the aformentioned space n n into the n-dimensional Borel space (R+; B ), such that the j-th coordinate, j = 1; : : : ; n, of X is a univariate Pareto of the second kind. In other words, Xj v P a(II)(¹j; σj; ®j), and µ ¶¡®j x ¡ ¹j F Xj (x) = 1 + ; x > ¹j; and σj; ®j > 0: σj 0 To this end, let Y = (Y0;Y1;:::;Yn) be an (n + 1) variate random vector possess- ing mutually independent univariate Pareto margins Y0 v P a(II)(0; 1; ®0) and Yj v n+1 n P a(II)(¹j; σj; ®j); j = 1; : : : ; n. Denote by T the functional map T : R+ ! R+. De¯nition 2.1. The random vector X = T (Y), such that Xj = min (σjY0 + ¹j;Yj) ; j = 1; : : : ; n; is said to follow the multivariate Pareto distribution of the second kind P an (II)(¹; σ; ®; ®0), n n with ¹ and σ, ® being constant vectors in R and R+, respectively, and ®0 > 0. Note 2.1. It is possible to formulate the de¯nition above for a slightly more general random vector Y by assuming Y0 v P a(II)(¹0; σ0; ®0), which is a non-standardized Pareto n+1 n of the second kind. In such a case, the map T : R+ ! R+ is such that µ ¶ σj Xj = min (Y0 ¡ ¹0) + ¹j;Yj ; j = 1; : : : ; n: σ0 However, as it is readily observed from Proposition 2.1, neither ¹0 nor σ0 influences the ddf of X, and they are hence set to ¹0 ´ 0 and σ0 ´ 1 without loss of generality. We note in passing that various particular cases of the just de¯ned multivariate Pareto have been studied. Namely, when ¹j = σj ´ const for j = 1; : : : ; n, the model is called the Marshall-Olkin type (see, Marshall and Olkin, 1967) multivariate Pareto in Kotz et al. (2000), and it has been explored by Muliere and Scarsini (1987) in the two dimensional, and by Hanagal (1996) in the multidimensional contexts. We further derive the multivariate ddf of the distribution de¯ned in De¯nition 2.1. To enhance the readability of the paper, we relegate the proof of the next proposition, as well as other technical proofs to the Appendix. 5 Proposition 2.1. Let X v P an(II)(¹; σ; ®; ®0). The ddf of X is then µ ¶ µ ¶ ¡®0 Yn ¡®j ¹ xj ¡ ¹j xj ¡ ¹j FX (x) = 1 + max 1 + ; xj > ¹j; j = 1; : : : ; n; (2.1) j=1;:::;n σ σ j j=1 j where ¹j are real, and σj; ®j; ®0 are positive constants. Note 2.2. It is straightforwardly observed that ddf (2.1) is invariant under linear trans- formations, and it thus belongs to a (multivariate) location-scale family of distributions. Consequently, in the sequel, we shall frequently use the so-called standardized form of the ddf, which certainly does not reduce the level of generality, but strongly contributes to the clarity of the results. More speci¯cally, we shall often consider the multivariate Pareto Z v P an(II)(0; 1; ®; ®0), with ¹ = 0 and σ = 1 being n-variate constant vectors of zeros and ones, respectively, and the ddf µ ¶ ¡®0 Yn ¹ ¡®j FZ (z1; : : : ; zn) = 1 + max zj (1 + zj) ; zj > 0; j = 1; : : : ; n: (2.2) j=1;:::;n j=1 It seems worth noticing that there exist alternative to De¯nition 2.1 ways to arrive at ddf (2.1).
Recommended publications
  • A New Parameter Estimator for the Generalized Pareto Distribution Under the Peaks Over Threshold Framework
    mathematics Article A New Parameter Estimator for the Generalized Pareto Distribution under the Peaks over Threshold Framework Xu Zhao 1,*, Zhongxian Zhang 1, Weihu Cheng 1 and Pengyue Zhang 2 1 College of Applied Sciences, Beijing University of Technology, Beijing 100124, China; [email protected] (Z.Z.); [email protected] (W.C.) 2 Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, USA; [email protected] * Correspondence: [email protected] Received: 1 April 2019; Accepted: 30 April 2019 ; Published: 7 May 2019 Abstract: Techniques used to analyze exceedances over a high threshold are in great demand for research in economics, environmental science, and other fields. The generalized Pareto distribution (GPD) has been widely used to fit observations exceeding the tail threshold in the peaks over threshold (POT) framework. Parameter estimation and threshold selection are two critical issues for threshold-based GPD inference. In this work, we propose a new GPD-based estimation approach by combining the method of moments and likelihood moment techniques based on the least squares concept, in which the shape and scale parameters of the GPD can be simultaneously estimated. To analyze extreme data, the proposed approach estimates the parameters by minimizing the sum of squared deviations between the theoretical GPD function and its expectation. Additionally, we introduce a recently developed stopping rule to choose the suitable threshold above which the GPD asymptotically fits the exceedances. Simulation studies show that the proposed approach performs better or similar to existing approaches, in terms of bias and the mean square error, in estimating the shape parameter.
    [Show full text]
  • Independent Approximates Enable Closed-Form Estimation of Heavy-Tailed Distributions
    Independent Approximates enable closed-form estimation of heavy-tailed distributions Kenric P. Nelson Abstract – Independent Approximates (IAs) are proven to enable a closed-form estimation of heavy-tailed distributions with an analytical density such as the generalized Pareto and Student’s t distributions. A broader proof using convolution of the characteristic function is described for future research. (IAs) are selected from independent, identically distributed samples by partitioning the samples into groups of size n and retaining the median of the samples in those groups which have approximately equal samples. The marginal distribution along the diagonal of equal values has a density proportional to the nth power of the original density. This nth-power-density, which the IAs approximate, has faster tail decay enabling closed-form estimation of its moments and retains a functional relationship with the original density. Computational experiments with between 1000 to 100,000 Student’s t samples are reported for over a range of location, scale, and shape (inverse of degree of freedom) parameters. IA pairs are used to estimate the location, IA triplets for the scale, and the geometric mean of the original samples for the shape. With 10,000 samples the relative bias of the parameter estimates is less than 0.01 and a relative precision is less than ±0.1. The theoretical bias is zero for the location and the finite bias for the scale can be subtracted out. The theoretical precision has a finite range when the shape is less than 2 for the location estimate and less than 3/2 for the scale estimate.
    [Show full text]
  • ACTS 4304 FORMULA SUMMARY Lesson 1: Basic Probability Summary of Probability Concepts Probability Functions
    ACTS 4304 FORMULA SUMMARY Lesson 1: Basic Probability Summary of Probability Concepts Probability Functions F (x) = P r(X ≤ x) S(x) = 1 − F (x) dF (x) f(x) = dx H(x) = − ln S(x) dH(x) f(x) h(x) = = dx S(x) Functions of random variables Z 1 Expected Value E[g(x)] = g(x)f(x)dx −∞ 0 n n-th raw moment µn = E[X ] n n-th central moment µn = E[(X − µ) ] Variance σ2 = E[(X − µ)2] = E[X2] − µ2 µ µ0 − 3µ0 µ + 2µ3 Skewness γ = 3 = 3 2 1 σ3 σ3 µ µ0 − 4µ0 µ + 6µ0 µ2 − 3µ4 Kurtosis γ = 4 = 4 3 2 2 σ4 σ4 Moment generating function M(t) = E[etX ] Probability generating function P (z) = E[zX ] More concepts • Standard deviation (σ) is positive square root of variance • Coefficient of variation is CV = σ/µ • 100p-th percentile π is any point satisfying F (π−) ≤ p and F (π) ≥ p. If F is continuous, it is the unique point satisfying F (π) = p • Median is 50-th percentile; n-th quartile is 25n-th percentile • Mode is x which maximizes f(x) (n) n (n) • MX (0) = E[X ], where M is the n-th derivative (n) PX (0) • n! = P r(X = n) (n) • PX (1) is the n-th factorial moment of X. Bayes' Theorem P r(BjA)P r(A) P r(AjB) = P r(B) fY (yjx)fX (x) fX (xjy) = fY (y) Law of total probability 2 If Bi is a set of exhaustive (in other words, P r([iBi) = 1) and mutually exclusive (in other words P r(Bi \ Bj) = 0 for i 6= j) events, then for any event A, X X P r(A) = P r(A \ Bi) = P r(Bi)P r(AjBi) i i Correspondingly, for continuous distributions, Z P r(A) = P r(Ajx)f(x)dx Conditional Expectation Formula EX [X] = EY [EX [XjY ]] 3 Lesson 2: Parametric Distributions Forms of probability
    [Show full text]
  • Fixed-K Asymptotic Inference About Tail Properties
    Journal of the American Statistical Association ISSN: 0162-1459 (Print) 1537-274X (Online) Journal homepage: http://www.tandfonline.com/loi/uasa20 Fixed-k Asymptotic Inference About Tail Properties Ulrich K. Müller & Yulong Wang To cite this article: Ulrich K. Müller & Yulong Wang (2017) Fixed-k Asymptotic Inference About Tail Properties, Journal of the American Statistical Association, 112:519, 1334-1343, DOI: 10.1080/01621459.2016.1215990 To link to this article: https://doi.org/10.1080/01621459.2016.1215990 Accepted author version posted online: 12 Aug 2016. Published online: 13 Jun 2017. Submit your article to this journal Article views: 206 View related articles View Crossmark data Full Terms & Conditions of access and use can be found at http://www.tandfonline.com/action/journalInformation?journalCode=uasa20 JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION , VOL. , NO. , –, Theory and Methods https://doi.org/./.. Fixed-k Asymptotic Inference About Tail Properties Ulrich K. Müller and Yulong Wang Department of Economics, Princeton University, Princeton, NJ ABSTRACT ARTICLE HISTORY We consider inference about tail properties of a distribution from an iid sample, based on extreme value the- Received February ory. All of the numerous previous suggestions rely on asymptotics where eventually, an infinite number of Accepted July observations from the tail behave as predicted by extreme value theory, enabling the consistent estimation KEYWORDS of the key tail index, and the construction of confidence intervals using the delta method or other classic Extreme quantiles; tail approaches. In small samples, however, extreme value theory might well provide good approximations for conditional expectations; only a relatively small number of tail observations.
    [Show full text]
  • Package 'Renext'
    Package ‘Renext’ July 4, 2016 Type Package Title Renewal Method for Extreme Values Extrapolation Version 3.1-0 Date 2016-07-01 Author Yves Deville <[email protected]>, IRSN <[email protected]> Maintainer Lise Bardet <[email protected]> Depends R (>= 2.8.0), stats, graphics, evd Imports numDeriv, splines, methods Suggests MASS, ismev, XML Description Peaks Over Threshold (POT) or 'methode du renouvellement'. The distribution for the ex- ceedances can be chosen, and heterogeneous data (including histori- cal data or block data) can be used in a Maximum-Likelihood framework. License GPL (>= 2) LazyData yes NeedsCompilation no Repository CRAN Date/Publication 2016-07-04 20:29:04 R topics documented: Renext-package . .3 anova.Renouv . .5 barplotRenouv . .6 Brest . .8 Brest.years . 10 Brest.years.missing . 11 CV2............................................. 11 CV2.test . 12 Dunkerque . 13 EM.mixexp . 15 expplot . 16 1 2 R topics documented: fgamma . 17 fGEV.MAX . 19 fGPD ............................................ 22 flomax . 24 fmaxlo . 26 fweibull . 29 Garonne . 30 gev2Ren . 32 gof.date . 34 gofExp.test . 37 GPD............................................. 38 gumbel2Ren . 40 Hpoints . 41 ini.mixexp2 . 42 interevt . 43 Jackson . 45 Jackson.test . 46 logLik.Renouv . 47 Lomax . 48 LRExp . 50 LRExp.test . 51 LRGumbel . 53 LRGumbel.test . 54 Maxlo . 55 MixExp2 . 57 mom.mixexp2 . 58 mom2par . 60 NBlevy . 61 OT2MAX . 63 OTjitter . 66 parDeriv . 67 parIni.MAX . 70 pGreenwood1 . 72 plot.Rendata . 73 plot.Renouv . 74 PPplot . 78 predict.Renouv . 79 qStat . 80 readXML . 82 Ren2gev . 84 Ren2gumbel . 85 Renouv . 87 RenouvNoEst . 94 RLlegend . 96 RLpar . 98 RLplot . 99 roundPred . 101 rRendata . 102 Renext-package 3 SandT . 105 skip2noskip .
    [Show full text]
  • Estimation of Pareto Distribution Functions from Samples Contaminated by Measurement Errors
    UNIVERSITY of the WESTERN CAPE Estimation of Pareto Distribution Functions from Samples Contaminated by Measurement Errors Lwando Orbet Kondlo A thesis submitted in partial fulfillment of the requirements for the degree of Magister Scientiae in Statistics in the Faculty of Natural Sciences at the University of the Western Cape. Supervisor : Professor Chris Koen March 1, 2010 Keywords Deconvolution Distribution functions Error-Contaminated samples Errors-in-variables Jackknife Maximum likelihood method Measurement errors Nonparametric estimation Pareto distribution ABSTRACT MSc Statistics Thesis, Department of Statistics, University of the Western Cape. Estimation of population distributions, from samples which are contaminated by measurement errors, is a common problem. This study considers the prob- lem of estimating the population distribution of independent random variables Xj, from error-contaminated samples Yj (j = 1, . , n) such that Yj = Xj +j, where is the measurement error, which is assumed independent of X. The measurement error is also assumed to be normally distributed. Since the observed distribution function is a convolution of the error distribution with the true underlying distribution, estimation of the latter is often referred to as a deconvolution problem. A thorough study of the relevant deconvolution literature in statistics is reported. We also deal with the specific case when X is assumed to follow a truncated Pareto form. If observations are subject to Gaussian errors, then the observed Y is distributed as the convolution of the finite-support Pareto and Gaus- sian error distributions. The convolved probability density function (PDF) and cumulative distribution function (CDF) of the finite-support Pareto and Gaussian distributions are derived. The intention is to draw more specific connections between certain deconvolu- tion methods and also to demonstrate the application of the statistical theory of estimation in the presence of measurement error.
    [Show full text]
  • Handbook on Probability Distributions
    R powered R-forge project Handbook on probability distributions R-forge distributions Core Team University Year 2009-2010 LATEXpowered Mac OS' TeXShop edited Contents Introduction 4 I Discrete distributions 6 1 Classic discrete distribution 7 2 Not so-common discrete distribution 27 II Continuous distributions 34 3 Finite support distribution 35 4 The Gaussian family 47 5 Exponential distribution and its extensions 56 6 Chi-squared's ditribution and related extensions 75 7 Student and related distributions 84 8 Pareto family 88 9 Logistic distribution and related extensions 108 10 Extrem Value Theory distributions 111 3 4 CONTENTS III Multivariate and generalized distributions 116 11 Generalization of common distributions 117 12 Multivariate distributions 133 13 Misc 135 Conclusion 137 Bibliography 137 A Mathematical tools 141 Introduction This guide is intended to provide a quite exhaustive (at least as I can) view on probability distri- butions. It is constructed in chapters of distribution family with a section for each distribution. Each section focuses on the tryptic: definition - estimation - application. Ultimate bibles for probability distributions are Wimmer & Altmann (1999) which lists 750 univariate discrete distributions and Johnson et al. (1994) which details continuous distributions. In the appendix, we recall the basics of probability distributions as well as \common" mathe- matical functions, cf. section A.2. And for all distribution, we use the following notations • X a random variable following a given distribution, • x a realization of this random variable, • f the density function (if it exists), • F the (cumulative) distribution function, • P (X = k) the mass probability function in k, • M the moment generating function (if it exists), • G the probability generating function (if it exists), • φ the characteristic function (if it exists), Finally all graphics are done the open source statistical software R and its numerous packages available on the Comprehensive R Archive Network (CRAN∗).
    [Show full text]
  • Theoretical Properties of the Weighted Feller-Pareto Distributions." Asian Journal of Mathematics and Applications, 2014: 1-12
    CORE Metadata, citation and similar papers at core.ac.uk Provided by Georgia Southern University: Digital Commons@Georgia Southern Georgia Southern University Digital Commons@Georgia Southern Mathematical Sciences Faculty Publications Mathematical Sciences, Department of 2014 Theoretical Properties of the Weighted Feller- Pareto Distributions Oluseyi Odubote Georgia Southern University Broderick O. Oluyede Georgia Southern University, [email protected] Follow this and additional works at: https://digitalcommons.georgiasouthern.edu/math-sci-facpubs Part of the Mathematics Commons Recommended Citation Odubote, Oluseyi, Broderick O. Oluyede. 2014. "Theoretical Properties of the Weighted Feller-Pareto Distributions." Asian Journal of Mathematics and Applications, 2014: 1-12. source: http://scienceasia.asia/index.php/ama/article/view/173/ https://digitalcommons.georgiasouthern.edu/math-sci-facpubs/307 This article is brought to you for free and open access by the Mathematical Sciences, Department of at Digital Commons@Georgia Southern. It has been accepted for inclusion in Mathematical Sciences Faculty Publications by an authorized administrator of Digital Commons@Georgia Southern. For more information, please contact [email protected]. ASIAN JOURNAL OF MATHEMATICS AND APPLICATIONS Volume 2014, Article ID ama0173, 12 pages ISSN 2307-7743 http://scienceasia.asia THEORETICAL PROPERTIES OF THE WEIGHTED FELLER-PARETO AND RELATED DISTRIBUTIONS OLUSEYI ODUBOTE AND BRODERICK O. OLUYEDE Abstract. In this paper, for the first time, a new six-parameter class of distributions called weighted Feller-Pareto (WFP) and related family of distributions is proposed. This new class of distributions contains several other Pareto-type distributions such as length-biased (LB) Pareto, weighted Pareto (WP I, II, III, and IV), and Pareto (P I, II, III, and IV) distributions as special cases.
    [Show full text]
  • On Families of Generalized Pareto Distributions: Properties and Applications
    Journal of Data Science 377-396 , DOI: 10.6339/JDS.201804_16(2).0008 On Families of Generalized Pareto Distributions: Properties and Applications D. Hamed1 and F. Famoye2 and C. Lee2 1 Department of Mathematics, Winthrop University, Rock Hill, SC, USA; 2Department of Mathematics, Central Michigan University, Mount Pleasant, Michigan, USA In this paper, we introduce some new families of generalized Pareto distributions using the T-R{Y} framework. These families of distributions are named T-Pareto{Y} families, and they arise from the quantile functions of exponential, log-logistic, logistic, extreme value, Cauchy and Weibull distributions. The shapes of these T-Pareto families can be unimodal or bimodal, skewed to the left or skewed to the right with heavy tail. Some general properties of the T-Pareto{Y} family are investigated and these include the moments, modes, mean deviations from the mean and from the median, and Shannon entropy. Several new generalized Pareto distributions are also discussed. Four real data sets from engineering, biomedical and social science are analyzed to demonstrate the flexibility and usefulness of the T-Pareto{Y} families of distributions. Key words: Shannon entropy; quantile function; moment; T-X family. 1. Introduction The Pareto distribution is named after the well-known Italian-born Swiss sociologist and economist Vilfredo Pareto (1848-1923). Pareto [1] defined Pareto’s Law, which can be stated as N Axa , where N represents the number of persons having income x in a population. Pareto distribution is commonly used in modelling heavy tailed distributions, including but not limited to income, insurance and city size populations.
    [Show full text]
  • A New Generalization of the Pareto Distribution and Its Application to Insurance Data
    Journal of Risk and Financial Management Article A New Generalization of the Pareto Distribution and Its Application to Insurance Data Mohamed E. Ghitany 1,*, Emilio Gómez-Déniz 2 and Saralees Nadarajah 3 1 Department of Statistics and Operations Research, Faculty of Science, Kuwait University, Safat 13060, Kuwait 2 Department of Quantitative Methods and TiDES Institute, University of Las Palmas de Gran Canaria, 35017 Gran Canaria, Spain; [email protected] 3 School of Mathematics, University of Manchester, Manchester M13 9PL, UK; [email protected] * Correspondence: [email protected] Received: 26 November 2017; Accepted: 2 February 2018; Published: 7 February 2018 Abstract: The Pareto classical distribution is one of the most attractive in statistics and particularly in the scenario of actuarial statistics and finance. For example, it is widely used when calculating reinsurance premiums. In the last years, many alternative distributions have been proposed to obtain better adjustments especially when the tail of the empirical distribution of the data is very long. In this work, an alternative generalization of the Pareto distribution is proposed and its properties are studied. Finally, application of the proposed model to the earthquake insurance data set is presented. Keywords: gamma distribution; estimation; financial risk; fit; pareto distribution MSC: 62E10; 62F10; 62F15; 62P05 1. Introduction In general insurance, only a few large claims arising in the portfolio represent the largest part of the payments made by the insurance company. Appropriate estimation of these extreme events is crucial for the practitioner to correctly assess insurance and reinsurance premiums. On this subject, the single parameter Pareto distribution (Arnold 1983; Brazauskas and Serfling 2003; Rytgaard 1990), among others has been traditionally considered as a suitable claim size distribution in relation to rating problems.
    [Show full text]
  • Exploring Heavy Tails Pareto and Generalized Pareto Distributions
    Exploring Heavy Tails Pareto and Generalized Pareto Distributions September 25, 2019 This vignette is designed to give a short overview about Pareto Distributions and Generalized Pareto Distributions (GPD). We will work with the SPC.we data of our quantmod vignette. Therefore we have to reproduce the SPC.we data in exactly the same way as described the quantmod vignette. In financial data analysis stock indices as the S&P 500 index are typically analyzed by using the returns of the index. We use the log-returns > WSPLRet <- diff(log(SPC.we)) We start to analyze these by plotting a histogram > hist(WSPLRet) Histogram of WSPLRet 1000 800 600 Frequency 400 200 0 −0.20 −0.15 −0.10 −0.05 0.00 0.05 0.10 0.15 WSPLRet Figure 1: Histogram of the log-returns of the S&P 500 from 1960-01-04 to 2009-01-01. This histogram shows a unimodal distribution of values with the peak around 0, which nourishes the hypothesis that the log-returns are normally distributed. A very intuitive method to test this is the Q-Q plot. The slope of the (linear regression) line and its intercept determine the parameters of the corresponding Gaussian distribution. If the points are close to this line the empirical distribution of the sample can 1 > qqnorm(WSPLRet) > qqline(WSPLRet) Normal Q−Q Plot 0.10 0.05 0.00 −0.05 Sample Quantiles −0.10 −0.15 −0.20 −3 −2 −1 0 1 2 3 Theoretical Quantiles Figure 2: Q-Q plot of WSPLRet values.
    [Show full text]
  • The Pareto Distribution
    The Pareto Distribution Background Power Function Consider an arbitrary power function, x↦kx α where k is a constant and the exponent α gov- erns the relationship. Note that if y=kx α, then Log[y] = Log[k] +α Log[x]. That is, the logarith- mic transformation of this power function is linear in Log[x]. Another way to say this is that the elasticity of y with respect to x is constant: d Log[y] =α. Since the elasticity does not depend on d Log[x] the size of x, we sometimes say that the power relationship is scale invariant. Levels Log Transform 4 5 3 1 0.50 2 0.10 1 0.05 0.5 1.0 1.5 2.0 0.2 0.5 1 2 Two Power Functions Power Law A power law is a theoretical or empirical relationship governed by a power function. ◼ In geometry, the area of a regular polygon is proportional to the square of the length of a side. ◼ In physics, the gravitational attraction of two objects is inversely proportional to the square of their distance. ◼ In ecology, Taylor’s Law states that the variance of population density is a power-function of mean population density. ◼ In economics, Gabaix (1999) finds the population of cities follows a power law (with an inequality parameter close to 1 (see below). ◼ In economics, Luttmer (2007) finds the distribution of employment in US firms follows a power law with inequality parameter close to 1. 2 temp99.nb ◼ In economics and business, the Pareto Principle (or 80-20 rule) says that 80% of income accrues to the top 20% of income recipients.
    [Show full text]