Beta-Normal Distribution: Bimodality Properties and Application Felix Famoye Central Michigan University, [email protected]

Total Page:16

File Type:pdf, Size:1020Kb

Beta-Normal Distribution: Bimodality Properties and Application Felix Famoye Central Michigan University, Felix.Famoye@Cmich.Edu Journal of Modern Applied Statistical Methods Volume 3 | Issue 1 Article 10 5-1-2004 Beta-Normal Distribution: Bimodality Properties and Application Felix Famoye Central Michigan University, [email protected] Carl Lee Central Michigan University Nicholas Eugene Coppin State College Follow this and additional works at: http://digitalcommons.wayne.edu/jmasm Part of the Applied Statistics Commons, Social and Behavioral Sciences Commons, and the Statistical Theory Commons Recommended Citation Famoye, Felix; Lee, Carl; and Eugene, Nicholas (2004) "Beta-Normal Distribution: Bimodality Properties and Application," Journal of Modern Applied Statistical Methods: Vol. 3 : Iss. 1 , Article 10. DOI: 10.22237/jmasm/1083370200 Available at: http://digitalcommons.wayne.edu/jmasm/vol3/iss1/10 This Regular Article is brought to you for free and open access by the Open Access Journals at DigitalCommons@WayneState. It has been accepted for inclusion in Journal of Modern Applied Statistical Methods by an authorized editor of DigitalCommons@WayneState. Journal of Modern Applied Statistical Methods Copyright © 2004 JMASM, Inc. May, 2004, Vol. 3, No. 1, 85- 103 1538 – 9472/04/$95.00 Beta-Normal Distribution: Bimodality Properties and Application Felix Famoye Carl Lee Nicholas Eugene Department of Mathematics Department of Mathematics Computer Science & Mathematics Central Michigan University Central Michigan University Coppin State College The beta-normal distribution is characterized by four parameters that jointly describe the location, the scale and the shape properties. The beta-normal distribution can be unimodal or bimodal. This paper studies the bimodality properties of the beta-normal distribution. The region of bimodality in the parameter space is obtained. The beta-normal distribution is applied to fit a numerical bimodal data set. The beta-normal fits are compared with the fits of mixture-normal distribution through simulation. Key words: Bimodal region, percentiles, curve estimation, egg size distribution Introduction Let Fx() be the cumulative distribution Bimodal distributions occur in many function (CDF) of a random variable X. The areas of science. Withington et al. (2000), in cumulative distribution function for a their study of cardiopulmonary bypass in infants generalized class of distributions for the random showed that plasma vecuronium and vecuronium variable X can be defined as the logit of the beta clearance requirements have bimodal random variable given by distributions. They concluded that their findings on bimodal distributions for plasma vecuronium Γ()αβ+ Fx() Gx()= ∫ tαβ−−11 (1− t ) dt , 0<<αβ , ∞ . and vecuronium clearance requirements ΓΓ()()αβ0 highlight the need for individual monitoring of (1.1) neuromuscular blockade. Espinoza et al. (2001) discussed the importance of bimodal distributions in the study of size distribution of Eugene et al. (2002) considered Fx() as the metals in aerosols. Bimodal distributions also CDF of the normal distribution with parameters occur in the study of genetic diversity (Freeland µ and σ. Thus, the random variable X has the et al., 2000), in the study of agricultural farm beta-normal distribution with probability density size distribution (Wolf & Sumner, 2001), in the function (pdf) study of atmospheric pressure (Zangvil et al., 2001), and in the study of anabolic steroids on Γ()αβ+ xxx−−−µµµαβ−−11 gx()= ⎡⎤⎡⎤Φ−Φ ( ) 1 ( )σ−1φ ( ) animals (Isaacson, 2000). ΓΓ()()αβ⎣⎦⎣⎦⎢⎥⎢⎥σσσ (1.2) Acknowledgement ⎛⎞x − µ where φ ⎜⎟ is the normal pdf and The first two authors gratefully acknowledge the ⎝⎠σ support received from the President Research ⎛⎞x − µ Investment Fund at Central Michigan University Φ⎜⎟ is the normal CDF. We denote the under the Grant #65139. Address ⎝⎠σ correspondence to the first author at Department beta-normal distribution with parameters α, β, µ, of Mathematics, Central Michigan University Mt. Pleasant, Michigan 48859-0001. Email: and σ as BN (α, β, µ, σ). [email protected]. Telephone: (989) 774- The distribution in (1.2) may be 5497. symmetric, skewed to the left, or skewed to the 85 BETA-NORMAL DISTRIBUTION 86 right. The distribution may be unimodal or Corollary 1: If α = β and one mode of bimodal. Eugene et al. (2002) discussed the BN(αβ , , µ, σ) is atx , then the other mode shape properties of the unimodal beta-normal 0 distribution. Furthermore, they considered the is at the point 2µ − x0 . estimation of its parameters by the method of maximum likelihood. Proof: If BN(αβ , , µ, σ) is unimodal, then In the analysis of bimodal data, a mixture of two normal densities is often used as the only mode occurs at the point x0 = µ. For a model (e.g., Cobb et al., 1983). The mixture of bimodal case, we need to show that if we replace normal distribution is used as a model to analyze x0 with 2µ − x0 , then equation (2.1) remains bimodal data because the mixture of normal the same. When α = β , equation (2.1) becomes densities can take on bimodal shapes depending on the parameters of the distribution. x − µ Eisenberger (1964) showed how the parameters σφ()(1)0 α − σ x0 − µ x0 = xx−−µµ{}12− Φ().+ µ of a mixture of normal distributions determine Φ−Φ()[1()]00 σ its shape. When a mixture assumption is not σσ required or justified the beta-normal distribution (2.2) can serve as a model to analyze data since only one distribution has to be used and one less If x0 in (2.2) is replaced with 2µ − x0 , we parameter to estimate. obtain In the rest of the paper, we provide some bimodality properties of the beta-normal µ −x σφ()(1)0 α − distribution. We obtain the region of bimodality σ µ −x0 2()µµ−Φx0 = µµ−−xx{}12− + in the parameter space. We also illustrate the Φ−Φ()[1()]00σ application of beta-normal distribution to a σσ numerical data set that exhibits two modes and (2.3) compare the fit with mixture-normal distribution. A simulation study is conducted to ⎛⎞⎛⎞x00−−µµx By using φφ⎜⎟⎜⎟= and compare the performance between beta-normal ⎝⎠⎝⎠σσ and mixture-normal distributions in fitting bimodal data. ⎛⎞x00−−µµ ⎛⎞x Φ=−Φ⎜⎟1 ⎜⎟ in (2.3) and on ⎝⎠σσ ⎝⎠ Bimodality Properties simplification, we get the result in (2.2). In this section, some results on the bimodality properties of beta-normal distribution Corollary 2: If BN(αβ , , µ, σ) has a mode at are obtained. x , then BN(,βα , µ, σ) has a mode at 0 Fact: A mode of the BN(αβ , , µ, σ) is any 2µ − x0 . point xx00= (,αβ ) that satisfies Proof: We need to show that if we replace α xx−−µµ σφ()00(1)α− φ () σ with β, and 2µ − x withx , equation (2.1) σσ 0 0 x0 = xxx−−−µµµ{}2.−−αβ++µ 1(−Φ000 ) Φ ( )[1()] −Φ remains the same. Equation (2.1) can be written σσσas (2.1) x −µ σφ()0 σ x0 −µ Proof: Differentiating BN(αβ , , µ, σ) in x0 =+xx−−µµ{}(2−−Φαβ ) ( )+ ( α− 1)µ . Φ−Φ()[1()]00 σ (1.2) with respect to x, setting it equal to zero, σσ and solving for x gives the result in (2.1). (2.4) 87 FAMOYE, LEE, & EUGENE If x0 is replaced with 2µ − x0 and α is replaced Hence x0(,αβ ) is an increasing function of α ⎛⎞⎛⎞x00−−µµx and a decreasing function of β. with β in (2.4), using φφ⎜⎟⎜⎟= ⎝⎠⎝⎠σσ Eugene et al. (2002) showed that the ⎛⎞x00−−µµ ⎛⎞x and Φ=−Φ⎜⎟1 ⎜⎟, and on beta-normal distribution is symmetric about µ ⎝⎠σσ ⎝⎠ when α = β. From this result and corollary 3, the simplification, we obtain the result in (2.4). modal value is greater than µ if α > β. Also, the modal value is less than µ if α < β. The beta- Corollary 3: The modal point x0(,αβ ) is an normal distribution has a very distinct property increasing function of α and a decreasing in that it can be used to describe both bimodal and unimodal data. function of β. Region of Bimodality Proof: Differentiating the result in (2.1) with The beta-normal distribution becomes respect to α and β gives bimodal for certain values of the parameters α and β, and the analytical solution of α and β , x − µ where the distribution becomes bimodal, cannot σφ()0 ∂x0(,αβ ) σ be solved algebraically. A numerical solution is =>x − µ 0 ∂α Φ()0 obtained, however, by solving the number of σ roots of the derivative of BN(α, β, µ, σ ). Table 1 shows a grid of values where the distribution and is bimodal. The “2” in Table 1 indicates that the x − µ −σφ()0 beta-normal distribution has two turning points ∂x0(,αβ ) σ =<x − µ 0 . which implies that the distribution is bimodal ∂β 1(−Φ 0 ) σ and the “1” indicates that the beta-normal distribution has one turning point which implies that the distribution is unimodal. BETA-NORMAL DISTRIBUTION 88 Table 1. Number of turning points of BN(,αβ , 0, 1) for various values of α and β Beta .01 .02 .03 .04 .05 .06 .07 .08 .09 .10 .11 .12 .13 .14 .15 .16 .17 .18 .19 .02 .21 .22 Alpha .01 2 2 2 2 2 2222222222211 1 1 1 1 .02 2 2 2 2 2 2222222222211 1 1 1 1 .03 2 2 2 2 2 2222222222211 1 1 1 1 .04 2 2 2 2 2 2222222222211 1 1 1 1 .05 2 2 2 2 2 2222222222211 1 1 1 1 .06 2 2 2 2 2 2222222222211 1 1 1 1 .07 2 2 2 2 2 2222222222211 1 1 1 1 .08 2 2 2 2 2 2222222222221 1 1 1 1 .09 2 2 2 2 2 2222222222221 1 1 1 1 .1 2 2 2 2 2 2222222222221 1 1 1 1 .11 2 2 2 2 2 2222222222221 1 1 1 1 .12 2 2 2 2 2 2222222222221 1 1 1 1 .13 2 2 2 2 2 2222222222222 1 1 1 1 .14 2 2 2 2 2 2222222222222 1 1 1 1 .15 2 2 2 2 2 2222222222222 1 1 1 1 .16 2 2 2 2 2 2222222222222 1 1 1 1 .17 1 1 1 1 1 1122222222222 2 1 1 1 .18 1 1 1 1 1 1111111222222 2 1 1 1 .19 1 1 1 1 1 1111111111122 2 2 1 1 .2 1 1 1 1 1 1111111111111 2 2 1 1 .21 1 1 1 1 1 1111111111111 1 1 2 1 .22 1 1 1 1 1 1111111111111 1 1 1 1 .23 1 1 1 1 1 1111111111111 1 1 1 1 .24 1 1 1 1 1 1111111111111 1 1 1 1 .25 1 1 1 1 1 1111111111111 1 1 1 1 .26 1 1 1 1 1 1111111111111 1 1 1 1 Note: “2” indicates where bimodality occurs and “1” indicates where unimodality occurs Numerically, the largest value of α or β that gives bimodal property is approximately 0.214.
Recommended publications
  • Exploring Bimodality in Introductory Computer Science Performance Distributions
    EURASIA Journal of Mathematics, Science and Technology Education, 2018, 14(10), em1591 ISSN:1305-8223 (online) OPEN ACCESS Research Paper https://doi.org/10.29333/ejmste/93190 Exploring Bimodality in Introductory Computer Science Performance Distributions Ram B Basnet 1, Lori K Payne 1, Tenzin Doleck 2*, David John Lemay 2, Paul Bazelais 2 1 Colorado Mesa University, Grand Junction, Colorado, USA 2 McGill University, Montreal, Quebec, CANADA Received 7 April 2018 ▪ Revised 19 June 2018 ▪ Accepted 21 June 2018 ABSTRACT This study examines student performance distributions evidence bimodality, or whether there are two distinct populations in three introductory computer science courses grades at a four-year southwestern university in the United States for the period 2014-2017. Results suggest that computer science course grades are not bimodal. These findings counter the double hump assertion and suggest that proper course sequencing can address the needs of students with varying levels of prior knowledge and obviate the double-hump phenomenon. Studying performance helps to improve delivery of introductory computer science courses by ensuring that courses are aligned with student needs and address preconceptions and prior knowledge and experience. Keywords: computer science performance, coding, programming, double hump, grade distribution, bimodal distribution, unimodal distribution BACKGROUND In recent years, there has been a resurgence of interest in the practice of coding (Kafai & Burke, 2013), with many pushing for making it a core competency for students (Lye & Koh, 2014). There are inherent challenges in learning to code evidenced by high failure and dropout rates in programming courses (Ma, Ferguson, Roper, & Wood, 2011; (Qian & Lehman, 2017; Robins, 2010).
    [Show full text]
  • Recombining Partitions Via Unimodality Tests
    Working Paper 13-07 Departamento de Estadística Statistics and Econometrics Series (06) Universidad Carlos III de Madrid March 2013 Calle Madrid, 126 28903 Getafe (Spain) Fax (34) 91 624-98-48 RECOMBINING PARTITIONS VIA UNIMODALITY TESTS Adolfo Álvarez (1) , Daniel Peña(2) Abstract In this article we propose a recombination procedure for previously split data. It is based on the study of modes in the density of the data, since departing from unimodality can be a sign of the presence of clusters. We develop an algorithm that integrates a splitting process inherited from the SAR algorithm (Peña et al., 2004) with unimodality tests such as the dip test proposed by Hartigan and Hartigan (1985), and finally, we use a network configuration to visualize the results. We show that this can be a useful tool to detect heterogeneity in the data, but limited to univariate data because of the nature of the dip test. In a second stage we discuss the use of multivariate mode detection tests to avoid dimensionality reduction techniques such as projecting multivariate data into one dimension. The results of the application of multivariate unimodality tests show that is possible to detect the cluster structure of the data, although more research can be oriented to estimate the proper fine-tuning of some parameters of the test for a given dataset or distribution. Keywords: Cluster analysis, unimodality, dip test. (1) Álvarez, Adolfo, Departamento de Estadística, Universidad Carlos III de Madrid, C/ Madrid 126, 28903 Getafe (Madrid), Spain, e-mail: [email protected] . (2) Peña, Daniel, Departamento de Estadística, Universidad Carlos III de Madrid, C/ Madrid 126, 28903 Getafe (Madrid), Spain, e-mail: [email protected] .
    [Show full text]
  • UNIMODALITY and the DIP STATISTIC; HANDOUT I 1. Unimodality Definition. a Probability Density F Defined on the Real Line R Is C
    UNIMODALITY AND THE DIP STATISTIC; HANDOUT I 1. Unimodality Definition. A probability density f defined on the real line R is called unimodal if the following hold: (i) f is nondecreasing on the half-line ( ,b) for some real b; (ii) f is nonincreasing on the half-line (−∞a, + ) for some real a; (iii) For the largest possible b in (i), which∞ exists, and the smallest possible a in (ii), which also exists, we have a b. ≤ Here a largest possible b exists since if f is nondecreasing on ( ,bk) −∞ for all k and bk b then f is also nondecreasing on ( ,b). Also b < + since f is↑ a probability density. Likewise a smallest−∞ possible a exists∞ and is finite. In the definition of unimodal density, if a = b it is called the mode of f. It is possible that f is undefined or + there, or that f(x) + as x a and/or as x a. ∞ ↑ ∞ If a<b↑ then f is constant↓ on the interval (a,b) and we can take it to be constant on [a,b], which will be called the interval of modes of f. Any x [a,b] will be called a mode of f. When a = b the interval of modes∈ reduces to the singleton a . { } Examples. Each normal distribution N(µ,σ2) has a unimodal density with a unique mode at µ. Each uniform U[a,b] distribution has a unimodal density with an interval of modes equal to [a,b]. Consider a−1 −x the gamma density γa(x) = x e /Γ(a) for x > 0 and 0 for x 0, where the shape parameter a > 0.
    [Show full text]
  • Field Guide to Continuous Probability Distributions
    Field Guide to Continuous Probability Distributions Gavin E. Crooks v 1.0.0 2019 G. E. Crooks – Field Guide to Probability Distributions v 1.0.0 Copyright © 2010-2019 Gavin E. Crooks ISBN: 978-1-7339381-0-5 http://threeplusone.com/fieldguide Berkeley Institute for Theoretical Sciences (BITS) typeset on 2019-04-10 with XeTeX version 0.99999 fonts: Trump Mediaeval (text), Euler (math) 271828182845904 2 G. E. Crooks – Field Guide to Probability Distributions Preface: The search for GUD A common problem is that of describing the probability distribution of a single, continuous variable. A few distributions, such as the normal and exponential, were discovered in the 1800’s or earlier. But about a century ago the great statistician, Karl Pearson, realized that the known probabil- ity distributions were not sufficient to handle all of the phenomena then under investigation, and set out to create new distributions with useful properties. During the 20th century this process continued with abandon and a vast menagerie of distinct mathematical forms were discovered and invented, investigated, analyzed, rediscovered and renamed, all for the purpose of de- scribing the probability of some interesting variable. There are hundreds of named distributions and synonyms in current usage. The apparent diver- sity is unending and disorienting. Fortunately, the situation is less confused than it might at first appear. Most common, continuous, univariate, unimodal distributions can be orga- nized into a small number of distinct families, which are all special cases of a single Grand Unified Distribution. This compendium details these hun- dred or so simple distributions, their properties and their interrelations.
    [Show full text]
  • On Multivariate Unimodal Distributions
    ON MULTIVARIATE UNIMODAL DISTRIBUTIONS By Tao Dai B. Sc. (Mathematics), Wuhan University, 1982 A THESIS SUBMITTED IN PARTIAL FULFILMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF SCIENCE in THE FACULTY OF GRADUATE STUDIES DEPARTMENT OF STATISTICS We accept this thesis as conforming to the required standard THE UNIVERSITY OF BRITISH COLUMBIA June 1989 © Tao Dai, 1989 In presenting this thesis in partial fulfilment of the requirements for an advanced degree at the University of British Columbia, I agree that the Library shall make it freely available for reference and study. I further agree that permission for extensive copying of this thesis for scholarly purposes may be granted by the head of my department or by his or her representatives. It is understood that copying or publication of this thesis for financial gain shall not be allowed without my written permission. Department of £ tasW> tj-v*> The University of British Columbia Vancouver, Canada Date ./ v^y^J. DE-6 (2/88) ABSTRACT In this thesis, Kanter's representation of multivariate unimodal distributions is shown equivalent to the usual mixture of uniform distributions on symmetric, compact and convex sets. Kanter's idea is utilized in several contexts by viewing multivariate distributions as mixtures of uniform distributions on sets of various shapes. This provides a unifying viewpoint of what is in the literature and .gives some important new classes of multivariate unimodal distributions. The closure properties of these new classes under convolution, marginality and weak convergence, etc. and their relationships with other notions of multivariate unimodality are discussed. Some interesting examples and their 2- or 3-dimensional pictures are presented.
    [Show full text]
  • Large Time Unimodality for Classical and Free Brownian Motions with Initial Distributions
    ALEA, Lat. Am. J. Probab. Math. Stat. 15, 353–374 (2018) DOI: 10.30757/ALEA.v15-15 Large time unimodality for classical and free Brownian motions with initial distributions Takahiro Hasebe and Yuki Ueda Department of Mathematics, Hokkaido University, Kita 10, Nishi 8, Kita-Ku, Sapporo, Hokkaido, 060-0810, Japan E-mail address: [email protected] URL: https://www.math.sci.hokudai.ac.jp/~thasebe/ Department of Mathematics, Hokkaido University, Kita 10, Nishi 8, Kita-Ku, Sapporo, Hokkaido, 060-0810, Japan E-mail address: [email protected] Abstract. We prove that classical and free Brownian motions with initial distri- butions are unimodal for sufficiently large time, under some assumption on the initial distributions. The assumption is almost optimal in some sense. Similar re- sults are shown for a symmetric stable process with index 1 and a positive stable process with index 1/2. We also prove that free Brownian motion with initial sym- metric unimodal distribution is unimodal, and discuss strong unimodality for free convolution. 1. Introduction A Borel measure µ on R is unimodal if there exist a R and a function f : R [0, ) which is non-decreasing on ( ,a) and non-increasing∈ on (a, ), such that→ ∞ −∞ ∞ µ(dx)= µ( a )δ + f(x) dx. (1.1) { } a The most outstanding result on unimodality is Yamazato’s theorem (Yamazato, 1978) saying that all classical selfdecomposable distributions are unimodal. After this result, in Hasebe and Thorbjørnsen (2016), Hasebe and Thorbjørnsen proved the free analog of Yamazato’s result: all freely selfdecomposable distributions are unimodal.
    [Show full text]
  • Nber Working Paper Series the Distribution of the Size
    NBER WORKING PAPER SERIES THE DISTRIBUTION OF THE SIZE OF PRICE CHANGES Alberto Cavallo Roberto Rigobon Working Paper 16760 http://www.nber.org/papers/w16760 NATIONAL BUREAU OF ECONOMIC RESEARCH 1050 Massachusetts Avenue Cambridge, MA 02138 February 2011 We are grateful to Alberto Alesina, Emmanuel Farhi, Oleksiy Kryvtsov, Greg Mankiw, Dan Sichel, and seminar participants at Harvard, MIT, NBER, AEA, BLS, ITAM, Oxford, Queen’s University at Belfast, and the Central Bank of Norway for helpful comments and suggestions. All remaining errors are ours. The views expressed herein are those of the authors and do not necessarily reflect the views of the National Bureau of Economic Research. NBER working papers are circulated for discussion and comment purposes. They have not been peer- reviewed or been subject to the review by the NBER Board of Directors that accompanies official NBER publications. © 2011 by Alberto Cavallo and Roberto Rigobon. All rights reserved. Short sections of text, not to exceed two paragraphs, may be quoted without explicit permission provided that full credit, including © notice, is given to the source. The Distribution of the Size of Price Changes Alberto Cavallo and Roberto Rigobon NBER Working Paper No. 16760 February 2011 JEL No. E00,E3 ABSTRACT Different theories of price stickiness have distinct implications on the properties of the distribution of price changes. One of those characteristics is the number of modes in the distribution. We formally test for the number of modes in the price change distribution of 32 supermarkets, spanning 23 countries and 5 continents. We present results for three modality tests: the two best-known tests in the statistical literature, Hartigan’s Dip and Silverman’s Bandwith, and a test designed in this paper, called the Proportional Mass test (PM).
    [Show full text]
  • The Basic Distributional Theory for the Product of Zero Mean Correlated
    The basic distributional theory for the product of zero mean correlated normal random variables Robert E. Gaunt∗ Abstract The product of two zero mean correlated normal random variables has received much attention in the statistics literature and appears in many application areas. However, many important distributional properties are yet to tabulated. This paper fills this gap by providing the basic distributional theory for the product of two zero mean correlated normal random variables. Keywords: Product of correlated normal random variables; distributional theory AMS 2010 Subject Classification: Primary 60E05; 62E15 1 Introduction 2 2 Let (X,Y ) be a bivariate normal random vector with zero mean vector, variances (σX , σY ) and correlation coefficient ρ. The distribution of the product Z = XY arises in nu- merous applications, with recent examples including statistics of Lagrangian power in two-dimensional turbulence (Bandi and Connaughton [3]), product confidence limits for indirect effects (MacKinnon et al. [28]) and statistical mediation analysis (MacKinnon −1 [27]). The mean Zn = n (Z1 + Z2 + + Zn), where Z1,Z2,...,Zn are independent and identical copies of Z, has also found··· application in areas such as electrical engineering (Ware and Lad [42]), astrophysics (Mangilli, Plaszczynski and Tristram [29], Watts et al. [43]) and quantum cosmology (Grishchuk [18]). The exact distribution of the product Z = XY has been studied since 1936 (Craig arXiv:2106.02897v1 [math.ST] 5 Jun 2021 [5]), with subsequent contributions coming from Aroian [1], Aroian, Taneja and Cornwell [2], Bandi and Connaughton [3], Haldane [20], Meeker et al. [31]; see Nadarajah and Pog´any [32] for an overview of these and further contributions.
    [Show full text]
  • Distributionally Robust Chance Constrained Optimal Power Flow Assuming Log-Concave Distributions
    Distributionally Robust Chance Constrained Optimal Power Flow Assuming Log-Concave Distributions B. Li 1, R. Jiang 2 and J.L. Mathieu 1 1EECS, University of Michigan 2IOE, University of Michigan November 2016 Bowen Li University of Michigan November 2016 1 / 17 Background: Chance Constrained Optimization Widely used in different problems with uncertainty But hard to accurately estimate uncertainty distributions with historical data Example: Physical constraints should be satisfied for most of the possible realizations of renewable/load uncertainty in power systems planning problems Solving methodologies Scenario approach: large number of scenarios/constraints Probabilistically robust method: overconservativeness Analytical reformulation: case dependent reliability General formulation P(L(x; ξ) ≤ 0) ≥ 1 − Bowen Li University of Michigan November 2016 2 / 17 Background: Distributionally Robust Optimization [1] Key properties A probabilistic constraint is satisfied within an ambiguity set of distributions The ambiguity set can be calibrated using the following information Moment-based information: mean, covariance, higher-order moments Density-based information: likelihood of a known probability density Distribution structure: unimodality, support, smoothness, symmetry Good trade-off between performance and computational tractability Example: distributionally robust chance constraint (DRCC) inf (L(x; ξ) ≤ 0) ≥ 1 − U P P2P U n T o P = Unimodal; EP(ξ) = µ, EP(ξξ ) = S [1] E. Delage and Y. Ye, \Distributionally Robust Optimization under Moment
    [Show full text]
  • Alpha-Skew Generalized T Distribution
    Revista Colombiana de Estadística July 2015, Volume 38, Issue 2, pp. 353 to 370 DOI: http://dx.doi.org/10.15446/rce.v38n2.51665 Alpha-Skew Generalized t Distribution Distribución t generalizada alfa sesgada Sukru Acitas1;a, Birdal Senoglu2;b, Olcay Arslan2;c 1Department of Statistics, Faculty of Science, Anadolu University, Eskisehir, Turkey 2Department of Statistics, Faculty of Science, Ankara University, Ankara, Turkey Abstract The alpha-skew normal (ASN) distribution has been proposed recently in the literature by using standard normal distribution and a skewing ap- proach. Although ASN distribution is able to model both skew and bimodal data, it is shortcoming when data has thinner or thicker tails than normal. Therefore, we propose an alpha-skew generalized t (ASGT) by using the gen- eralized t (GT) distribution and a new skewing procedure. From this point of view, ASGT can be seen as an alternative skew version of GT distribution. However, ASGT differs from the previous skew versions of GT distribution since it is able to model bimodal data sest as well as it nests most commonly used density functions. In this paper, moments and maximum likelihood estimation of the parameters of ASGT distribution are given. Skewness and kurtosis measures are derived based on the first four noncentral moments. The cumulative distribution function (cdf) of ASGT distribution is also ob- tained. In the application part of the study, two real life problems taken from the literature are modeled by using ASGT distribution. Key words: Bimodality, Kurtosis, Maximum Likelihood Estimation, Mod- eling, Skewness. Resumen La distribución normal alfa-sesgada (ASN por sus siglas en inglés) ha sido propuesta recientemente en la literatura mediante el uso de una distribución normal estándar y procedimientos de sesgo.
    [Show full text]
  • The Modes of a Mixture of Two Normal Distributions
    Silesian J. Pure Appl. Math. vol. 6, is. 1 (2016), 59–67 Grzegorz SITEK Department of Statistics, University of Economics in Katowice, Poland THE MODES OF A MIXTURE OF TWO NORMAL DISTRIBUTIONS Abstract. Mixture distributions arise naturally where a statistical pop- ulation contains two or more subpopulations. Finite mixture distributions refer to composite distributions constructed by mixing a number K of com- ponent distributions. The first account of mixture data being analyzed was documented by Pearson in 1894. We consider the distribution of a mix- ture of two normal distributions and investigate the conditions for which the distribution is bimodal. This paper presents a procedure for answer- ing the question of whether a mixture of two normal distributions which five known parameters µ1, µ2, σ1, σ2, p is unimodal or not. For finding the modes, a simple iterative procedure is given. This article presents the possibility of estimation of modes using biaverage. 1. Introduction Mixture models have been widely used in econometrics and social science, and the theories for mixture models have been well studied Lindsay (see [5]). The importance of the research for unimodality or bimodality in statistics have been described by Murphy (see [6]). Consider f(x, p)= pf (x)+(1 p)f (x) (1) 1 − 2 2010 Mathematics Subject Classification: 60E05, 62E99. Keywords: mixture of normal distributions, bimodal distributions, biaverage. Corresponding author: G. Sitek ([email protected]). Received: 25.07.2016. 60 G. Sitek where, for i =1, 2, 1 − − 2 2 (x µi) /2σi fi(x)= e σi√2π with 0 <p< 1. The function f(x, p) is the probability density function of a mixture of two normal distributions.
    [Show full text]
  • Distributionally Robust Chance Constrained Optimal Power
    1 Distributionally Robust Chance Constrained Optimal Power Flow Assuming Unimodal Distributions with Misspecified Modes Bowen Li, Student Member, IEEE, Ruiwei Jiang, Member, IEEE, and Johanna L. Mathieu, Member, IEEE Abstract—Chance constrained optimal power flow (CC-OPF) approximation [8], [9], which provides a priori guarantees formulations have been proposed to minimize operational costs on reliability, require the constraints to be satisfied over a while controlling the risk arising from uncertainties like renew- large number of uncertainty samples. The solutions from these able generation and load consumption. To solve CC-OPF, we often need access to the (true) joint probability distribution of approaches are usually overly conservative with high costs all uncertainties, which is rarely known in practice. A solution [7], [10]. Another popular approach is to assume that the based on a biased estimate of the distribution can result in uncertainties follow a parametric distribution such as Gaussian poor reliability. To overcome this challenge, recent work has [4], [5], [7]. The resulting CC-OPF is often easier to solve explored distributionally robust chance constraints, in which the but the solution may have low reliability unless the assumed chance constraints are satisfied over a family of distributions called the ambiguity set. Commonly, ambiguity sets are only probability distribution happens to be close to the true one. based on moment information (e.g., mean and covariance) of the As an alternative, distributionally robust chance constrained random variables; however, specifying additional characteristics (DRCC) OPF models do not depend on a single estimate of the random variables reduces conservatism and cost. Here, we of the probability distribution [10]–[17].
    [Show full text]