Extreme Events: the Amazing Weibull Distribution

Total Page:16

File Type:pdf, Size:1020Kb

Extreme Events: the Amazing Weibull Distribution Extreme events: the amazing Weibull distribution Jerzy Szulga Department of Mathematics and Statistics, Auburn University, AL 36849 Summary 1. Extreme probability distributions in eleven easy steps 2. Weibull distribution 3. The earthquakes: waiting for the Big One http://www.math.auburn.edu/~szulgje/talks/csu.pdf 1 There are two competing yet complementary avenues to develop mathemat- ical models of real life phenomena. The ¯rst approach refers to the laws of science, physics, logic, and deduction. In the second \black box approach", in spite of ungrasped causality one can still seek and ¯nd patterns in the data. In time, the discovered patterns will become rules, then laws, giving rise to a theory. Next, its laws will be either con¯rmed or contradicted by new observations, and the process will continue. We live in the extreme world, struggling with extreme forces of nature, atop of which come extremities of our own. Sometimes, their partial understanding is possible. We will focus on a probabilistic model that emerged in 1930s. The model may apply to a structure under the action of many random forces. Perhaps, the forces are too complicated to be described or simply unknown. The breaking point is reached when the maximum of these forces exceeds a certain level. Surprisingly, it can be proven that the behavior of the maximum is no longer arbitrary, for it will be constrained by just three types of probability distributions. One of the cases encompasses the Weibull distribution. On the other hand, the Weibull distribution is detected in many natural processes, time and time again, although the causes and reasons behind its presence remain foggy. I will present a mathematical outline of the extreme probabilities, and explain the role and properties of the Weibull distribution. Worldwide earthquake data will illustrate its connection to the real world. 2 From chaos to order A typical random noise 4 3 2 1 0 −1 −2 −3 0 100 200 300 400 500 600 700 800 900 1000 Figure 1: Independent random variables. 2 χ (6) binomial, sixes in 100 rolls of a die 25 30 20 15 20 10 10 5 0 0 0 200 400 600 800 1000 0 200 400 600 800 1000 geometric, waiting for the first six on a die Cauchy 200 30 150 20 100 50 10 0 0 −50 0 200 400 600 800 1000 0 200 400 600 800 1000 Figure 2: At the top of the chaotic behavior, maxima behave rather orderly. 3 History 1928 Ronald Fisher and Leonard Tippet distinguished only three types of limit distributions of the maxima of a sequence of i.i.d. random variables, subject to shift and scale. Two random variables X and Y are of the same type, if, within some a±ne transformation, they have the same distribution X =D aY + b; a > 0 1943 Boris Gnedenko proves that, if µ ¶ P max Xk · anx + bn ! P(Y · x); n ! 1; (1) k·n at the points of continuity of the limit distribution for some translation con- stants bn and scale constants an > 0, then the three types (excluding the trivial constant) are the distributions of the following random variables: ¯ ª¯, negative Weibull: Y = ¡V , ¡¯ ©¯, Fr¶echet, inverse Weibull: Y = V (2) ¤, Gumbel, log-Weibull: Y = ln V , where ¯ > 0 and V is an exponential random variable with the unit intensity. Gumbel's variable is but a cluster point of the Weibull variables: V ¯ ¡ 1 ln V = lim : ¯!0 ¯ 4 Weibull variables Frechet variables Gumbel variables 100 20 15 80 10 15 60 5 10 40 0 5 20 −5 0 0 −10 0 1 0 1 0 1 Weibull densities Frechet densities Gumbel densities 0 0 0 Figure 3: Three types of growth, three types of probability densities. 5 Karamata theory, the bliss and curse 1933 Jovan Karamata's concepts of variations of functions has had a great impact on the probability theory (see Feller's book). Consider a function F (t) in a vicinity of a point a which could be §1. The condition F (ct) ¯ lim = c (F 2 Va(¯) in short) t!a F (t) describes, accordingly: ¯ = 0: slow variation (like a logarithm); ¯nite ¯ 6= 0 regular variation (like a power); j¯j = 1 rapid variation (like an exponential). By simple algebra: 1 2 V (¡¯) () F (t) 2 V (¯) () F (1=s) 2 V (¯¡1) F (t) 1 1 0 The concepts are simple but many techniques are hard, cumbersome, and deep. One needs a great deal of real and complex analysis, function theory, measure theory, probability, etc. A typical presentation of the extreme value theory begins with Karamata. 6 Main issues 1. The proof of the extreme types theorem; 2. The choice of stabilizing constants; 3. The domains of attraction: which distributions will be attracted to the speci¯c type? The language: A. in terms of quantiles or inverse tails; B. in terms of tails or survival functions. Claim: \A" is easy. \B" is harder. The connection \A"$\B" is the hardest. We will outline 1{2{3, using the language\A". No external reference is needed, except for basic algebra, calculus, and ele- mentary probability and analysis. 7 All random variables are transforms of a uniform random variable Let U have the uniform distribution on [0; 1]. 1=¯ Example: V = ¡ ln U is exponential, W¯ = V is Weibull, etc. For the survival function G(x) = P(X > x), its right-continuous inverse: G¡1(u) =def inf f a : G(a) · u g =) X =D G¡1(U) Example. For a Weibull random variable X = W = V 1=¯; ¯ > 0, © ª G(w) = exp ¡w¯ ; w ¸ 0;G¡1(x) = (¡ ln x)1=¯; 0 < x · 1: 90 1 G(x) 0.5 G−1(x) 40 0 0 20 40 90 20 0 0 0.5 1 Figure 4: The survival (tail) function, and its inverse (quantile) function. Convergence of distributions D Xn ! X: P(Xn > x) ! P(X > x) at the continuity points of the limit distribution. Now: D ¡1 ¡1 ¡1 Xn ! X () Gn (u) ! G (u) at the continuity points of G : 8 Order statistics (k) Sort a ¯nite sequence (xk) to the ascending x(k) % or descending x &: (n) (1) x(1) = x = min xk; x = x(n) = max xk: k k For a sequence U1;:::;Un of independent uniform variables on [0; 1]: D 1=n (n) D D 1=n D ¡V=n U(n) = U ;U = 1 ¡ U(n) = 1 ¡ U = 1 ¡ e : (3) Indeed: µ ¶ h in ¡ ¢ n 1=n P U(n) · x = P max Uk · x = P(U · x) = x = P(U · x): k·n So, for X with the tail G(x) and its independent copies (Xk), we have ¡ ¢ ¡ ¢ (n) D ¡1 D ¡1 1=n X = G U(n) = G U ; ¡ ¢ ¡ ¢ (4) D ¡1 (n) D ¡1 ¡V=n X(n) = G U = G 1 ¡ e ; (U - uniform, V - exponential). The tempting simple replacement µ ¶ V V U (n) =D ; hence X =D G¡1 ; n (n) n is illegal! We have only a seemingly useless limit property: µ ¶ ³ ´ ³ ´n (n) x x x ¡x P U > = P min Uk > = 1 ¡ ! e = P(V > x): n k·n n n 9 Functions instead of probabilities We will examine the function g(x) = G¡1(1 ¡ e¡x) when x ! 0. Translation: Nondegenerate limit distributions in the the Extreme Values Theorem appear as distributions of Á(V ), where g(v=n) ¡ b Á(v) = lim n : (5) n!0 an De¯nition. Quantities © and Á are of the same type if © = Á ® + ¯; ® > 0; ¯; Step 1. \Types Lemma". The type is preserved in the limit. Let Án be a sequence of real functions on a set W that has at least two points and (an; bn) and (An;Bn) be two pairs of numerical sequences such that an > 0;An > 0. Suppose that there are non-constant functions on W , Á (v) ¡ b Á (v) ¡ A Á(v) = lim n n ; ©(v) = lim n n : n an n Bn Then a b ¡ B there exist numbers ® = lim n ; ¯ = lim n n (6) n An n An Conversely, if (6) holds and either Á or © exists, so exists the other. Under both assumptions, ©(v) = Á(v)® + ¯; v 2 W: Step 2. Let Á(v) exist everywhere, be right-continuous and non-constant. Then for every u > 0 there exist numbers ¯u; ®u > 0 such that Á(uv) = ®u Á(v) + ¯u; v > 0: 10 A±ne stability The multiplicative vs additive notation: f(x) = g(e¡x); x = ¡ ln u: (7) Let T ½ R be unbounded from above. Let f : (0; 1) ! R be nonconstant & nondecreasing. f is a±nely stable if for some functions a(t) > 0 and b(t) the limit exists f(x + y + t) ¡ b(t) L(a;b)(x) = lim (8) T 3t!1;y!1 a(y + t) The function Á(e¡x) (5) is a±nely stable by Step 2. © ª 1 Without this step, we would have only a poor set T = n . The limit along such T would yield a much weaker notion of a±ne stability, insu±cient for our purposes. For example, the integer value function f(x) = [x] would be stable in this weak sense for T = f n g, but only with bn = n; an = 1. 11 Immediate properties Step 3. All limit functions L(a;b) are of the same type. (Step 1, Types Lemma.) Step 4. There is a main form L = L(a0;b0) such that L(0) = 0;L(1) = 1: (use a±ne transformations: x0 7! 0; x1 7! 1;L(0) 7! 0;L(1) 7! 1) Step 5.
Recommended publications
  • A New Parameter Estimator for the Generalized Pareto Distribution Under the Peaks Over Threshold Framework
    mathematics Article A New Parameter Estimator for the Generalized Pareto Distribution under the Peaks over Threshold Framework Xu Zhao 1,*, Zhongxian Zhang 1, Weihu Cheng 1 and Pengyue Zhang 2 1 College of Applied Sciences, Beijing University of Technology, Beijing 100124, China; [email protected] (Z.Z.); [email protected] (W.C.) 2 Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, USA; [email protected] * Correspondence: [email protected] Received: 1 April 2019; Accepted: 30 April 2019 ; Published: 7 May 2019 Abstract: Techniques used to analyze exceedances over a high threshold are in great demand for research in economics, environmental science, and other fields. The generalized Pareto distribution (GPD) has been widely used to fit observations exceeding the tail threshold in the peaks over threshold (POT) framework. Parameter estimation and threshold selection are two critical issues for threshold-based GPD inference. In this work, we propose a new GPD-based estimation approach by combining the method of moments and likelihood moment techniques based on the least squares concept, in which the shape and scale parameters of the GPD can be simultaneously estimated. To analyze extreme data, the proposed approach estimates the parameters by minimizing the sum of squared deviations between the theoretical GPD function and its expectation. Additionally, we introduce a recently developed stopping rule to choose the suitable threshold above which the GPD asymptotically fits the exceedances. Simulation studies show that the proposed approach performs better or similar to existing approaches, in terms of bias and the mean square error, in estimating the shape parameter.
    [Show full text]
  • A Family of Skew-Normal Distributions for Modeling Proportions and Rates with Zeros/Ones Excess
    S S symmetry Article A Family of Skew-Normal Distributions for Modeling Proportions and Rates with Zeros/Ones Excess Guillermo Martínez-Flórez 1, Víctor Leiva 2,* , Emilio Gómez-Déniz 3 and Carolina Marchant 4 1 Departamento de Matemáticas y Estadística, Facultad de Ciencias Básicas, Universidad de Córdoba, Montería 14014, Colombia; [email protected] 2 Escuela de Ingeniería Industrial, Pontificia Universidad Católica de Valparaíso, 2362807 Valparaíso, Chile 3 Facultad de Economía, Empresa y Turismo, Universidad de Las Palmas de Gran Canaria and TIDES Institute, 35001 Canarias, Spain; [email protected] 4 Facultad de Ciencias Básicas, Universidad Católica del Maule, 3466706 Talca, Chile; [email protected] * Correspondence: [email protected] or [email protected] Received: 30 June 2020; Accepted: 19 August 2020; Published: 1 September 2020 Abstract: In this paper, we consider skew-normal distributions for constructing new a distribution which allows us to model proportions and rates with zero/one inflation as an alternative to the inflated beta distributions. The new distribution is a mixture between a Bernoulli distribution for explaining the zero/one excess and a censored skew-normal distribution for the continuous variable. The maximum likelihood method is used for parameter estimation. Observed and expected Fisher information matrices are derived to conduct likelihood-based inference in this new type skew-normal distribution. Given the flexibility of the new distributions, we are able to show, in real data scenarios, the good performance of our proposal. Keywords: beta distribution; centered skew-normal distribution; maximum-likelihood methods; Monte Carlo simulations; proportions; R software; rates; zero/one inflated data 1.
    [Show full text]
  • Procedures for Estimation of Weibull Parameters James W
    United States Department of Agriculture Procedures for Estimation of Weibull Parameters James W. Evans David E. Kretschmann David W. Green Forest Forest Products General Technical Report February Service Laboratory FPL–GTR–264 2019 Abstract Contents The primary purpose of this publication is to provide an 1 Introduction .................................................................. 1 overview of the information in the statistical literature on 2 Background .................................................................. 1 the different methods developed for fitting a Weibull distribution to an uncensored set of data and on any 3 Estimation Procedures .................................................. 1 comparisons between methods that have been studied in the 4 Historical Comparisons of Individual statistics literature. This should help the person using a Estimator Types ........................................................ 8 Weibull distribution to represent a data set realize some advantages and disadvantages of some basic methods. It 5 Other Methods of Estimating Parameters of should also help both in evaluating other studies using the Weibull Distribution .......................................... 11 different methods of Weibull parameter estimation and in 6 Discussion .................................................................. 12 discussions on American Society for Testing and Materials Standard D5457, which appears to allow a choice for the 7 Conclusion ................................................................
    [Show full text]
  • A New Weibull-X Family of Distributions: Properties, Characterizations and Applications Zubair Ahmad1* , M
    Ahmad et al. Journal of Statistical Distributions and Applications (2018) 5:5 https://doi.org/10.1186/s40488-018-0087-6 RESEARCH Open Access A new Weibull-X family of distributions: properties, characterizations and applications Zubair Ahmad1* , M. Elgarhy2 and G. G. Hamedani3 * Correspondence: [email protected] Abstract 1Department of Statistics, Quaid-i-Azam University 45320, We propose a new family of univariate distributions generated from the Weibull random Islamabad 44000, Pakistan variable, called a new Weibull-X family of distributions. Two special sub-models of the Full list of author information is proposed family are presented and the shapes of density and hazard functions are available at the end of the article investigated. General expressions for some statistical properties are discussed. For the new family, three useful characterizations based on truncated moments are presented. Three different methods to estimate the model parameters are discussed. Monti Carlo simulation study is conducted to evaluate the performances of these estimators. Finally, the importance of the new family is illustrated empirically via two real life applications. Keywords: Weibull distribution, T-X family, Moment, Characterizations, Order statistics, Estimation 1. Introduction In the field of reliability theory, modeling of lifetime data is very crucial. A number of statistical distributions such as Weibull, Pareto, Gompertz, linear failure rate, Rayleigh, Exonential etc., are available for modeling lifetime data. However, in many practical areas, these classical distributions do not provide adequate fit in modeling data, and there is a clear need for the extended version of these classical distributions. In this re- gard, serious attempts have been made to propose new families of continuous prob- ability distributions that extend the existing well-known distributions by adding additional parameter(s) to the model of the baseline random variable.
    [Show full text]
  • Package 'Renext'
    Package ‘Renext’ July 4, 2016 Type Package Title Renewal Method for Extreme Values Extrapolation Version 3.1-0 Date 2016-07-01 Author Yves Deville <[email protected]>, IRSN <[email protected]> Maintainer Lise Bardet <[email protected]> Depends R (>= 2.8.0), stats, graphics, evd Imports numDeriv, splines, methods Suggests MASS, ismev, XML Description Peaks Over Threshold (POT) or 'methode du renouvellement'. The distribution for the ex- ceedances can be chosen, and heterogeneous data (including histori- cal data or block data) can be used in a Maximum-Likelihood framework. License GPL (>= 2) LazyData yes NeedsCompilation no Repository CRAN Date/Publication 2016-07-04 20:29:04 R topics documented: Renext-package . .3 anova.Renouv . .5 barplotRenouv . .6 Brest . .8 Brest.years . 10 Brest.years.missing . 11 CV2............................................. 11 CV2.test . 12 Dunkerque . 13 EM.mixexp . 15 expplot . 16 1 2 R topics documented: fgamma . 17 fGEV.MAX . 19 fGPD ............................................ 22 flomax . 24 fmaxlo . 26 fweibull . 29 Garonne . 30 gev2Ren . 32 gof.date . 34 gofExp.test . 37 GPD............................................. 38 gumbel2Ren . 40 Hpoints . 41 ini.mixexp2 . 42 interevt . 43 Jackson . 45 Jackson.test . 46 logLik.Renouv . 47 Lomax . 48 LRExp . 50 LRExp.test . 51 LRGumbel . 53 LRGumbel.test . 54 Maxlo . 55 MixExp2 . 57 mom.mixexp2 . 58 mom2par . 60 NBlevy . 61 OT2MAX . 63 OTjitter . 66 parDeriv . 67 parIni.MAX . 70 pGreenwood1 . 72 plot.Rendata . 73 plot.Renouv . 74 PPplot . 78 predict.Renouv . 79 qStat . 80 readXML . 82 Ren2gev . 84 Ren2gumbel . 85 Renouv . 87 RenouvNoEst . 94 RLlegend . 96 RLpar . 98 RLplot . 99 roundPred . 101 rRendata . 102 Renext-package 3 SandT . 105 skip2noskip .
    [Show full text]
  • Hand-Book on STATISTICAL DISTRIBUTIONS for Experimentalists
    Internal Report SUF–PFY/96–01 Stockholm, 11 December 1996 1st revision, 31 October 1998 last modification 10 September 2007 Hand-book on STATISTICAL DISTRIBUTIONS for experimentalists by Christian Walck Particle Physics Group Fysikum University of Stockholm (e-mail: [email protected]) Contents 1 Introduction 1 1.1 Random Number Generation .............................. 1 2 Probability Density Functions 3 2.1 Introduction ........................................ 3 2.2 Moments ......................................... 3 2.2.1 Errors of Moments ................................ 4 2.3 Characteristic Function ................................. 4 2.4 Probability Generating Function ............................ 5 2.5 Cumulants ......................................... 6 2.6 Random Number Generation .............................. 7 2.6.1 Cumulative Technique .............................. 7 2.6.2 Accept-Reject technique ............................. 7 2.6.3 Composition Techniques ............................. 8 2.7 Multivariate Distributions ................................ 9 2.7.1 Multivariate Moments .............................. 9 2.7.2 Errors of Bivariate Moments .......................... 9 2.7.3 Joint Characteristic Function .......................... 10 2.7.4 Random Number Generation .......................... 11 3 Bernoulli Distribution 12 3.1 Introduction ........................................ 12 3.2 Relation to Other Distributions ............................. 12 4 Beta distribution 13 4.1 Introduction .......................................
    [Show full text]
  • Probability Distributions Used in Reliability Engineering
    Probability Distributions Used in Reliability Engineering Probability Distributions Used in Reliability Engineering Andrew N. O’Connor Mohammad Modarres Ali Mosleh Center for Risk and Reliability 0151 Glenn L Martin Hall University of Maryland College Park, Maryland Published by the Center for Risk and Reliability International Standard Book Number (ISBN): 978-0-9966468-1-9 Copyright © 2016 by the Center for Reliability Engineering University of Maryland, College Park, Maryland, USA All rights reserved. No part of this book may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or by any information storage and retrieval system, without permission in writing from The Center for Reliability Engineering, Reliability Engineering Program. The Center for Risk and Reliability University of Maryland College Park, Maryland 20742-7531 In memory of Willie Mae Webb This book is dedicated to the memory of Miss Willie Webb who passed away on April 10 2007 while working at the Center for Risk and Reliability at the University of Maryland (UMD). She initiated the concept of this book, as an aid for students conducting studies in Reliability Engineering at the University of Maryland. Upon passing, Willie bequeathed her belongings to fund a scholarship providing financial support to Reliability Engineering students at UMD. Preface Reliability Engineers are required to combine a practical understanding of science and engineering with statistics. The reliability engineer’s understanding of statistics is focused on the practical application of a wide variety of accepted statistical methods. Most reliability texts provide only a basic introduction to probability distributions or only provide a detailed reference to few distributions.
    [Show full text]
  • On Families of Generalized Pareto Distributions: Properties and Applications
    Journal of Data Science 377-396 , DOI: 10.6339/JDS.201804_16(2).0008 On Families of Generalized Pareto Distributions: Properties and Applications D. Hamed1 and F. Famoye2 and C. Lee2 1 Department of Mathematics, Winthrop University, Rock Hill, SC, USA; 2Department of Mathematics, Central Michigan University, Mount Pleasant, Michigan, USA In this paper, we introduce some new families of generalized Pareto distributions using the T-R{Y} framework. These families of distributions are named T-Pareto{Y} families, and they arise from the quantile functions of exponential, log-logistic, logistic, extreme value, Cauchy and Weibull distributions. The shapes of these T-Pareto families can be unimodal or bimodal, skewed to the left or skewed to the right with heavy tail. Some general properties of the T-Pareto{Y} family are investigated and these include the moments, modes, mean deviations from the mean and from the median, and Shannon entropy. Several new generalized Pareto distributions are also discussed. Four real data sets from engineering, biomedical and social science are analyzed to demonstrate the flexibility and usefulness of the T-Pareto{Y} families of distributions. Key words: Shannon entropy; quantile function; moment; T-X family. 1. Introduction The Pareto distribution is named after the well-known Italian-born Swiss sociologist and economist Vilfredo Pareto (1848-1923). Pareto [1] defined Pareto’s Law, which can be stated as N Axa , where N represents the number of persons having income x in a population. Pareto distribution is commonly used in modelling heavy tailed distributions, including but not limited to income, insurance and city size populations.
    [Show full text]
  • Discriminating Between the Weibull and Log-Normal Distributions
    Discriminating Between The Weibull and Log-Normal Distributions Debasis Kundu1 Anubhav Manglick2 Abstract Log-Normal and Weibull distributions are the most popular distributions for modeling skewed data. In this paper, we consider the ratio of the maximized likelihood in choosing between the two distributions. The asymptotic distribution of the logarithm of the maximized likeli- hood ratio has been obtained. It is observed that the asymptotic distribution is independent of the unknown parameters. The asymptotic distribution has been used to determine the minimum sample size required to discriminate between two families of distributions for a user speci¯ed probability of correct selection. We perform some numerical experiments to observe how the asymptotic methods work for di®erent sample sizes. It is observed that the asymptotic results work quite well even for small samples also. Two real data sets have been analyzed. Key Words and Phrases Asymptotic distributions; Likelihood ratio tests; Probability of correct selection; Location Scale Family. Address of Correspondence: Professor Debasis Kundu, Department of Mathematics, Indian Institute of Technology Kanpur, Pin 208016, INDIA. e-mail: [email protected]. 1 Department of Mathematics, Indian Institute of Technology Kanpur, Pin 208016, INDIA. 2 Faculty of Mathematics and Informatics, University of Passau, GERMANY. 1 1 Introduction: We address the following problem in this paper. Suppose an experimenter has observed n data points, say x1; : : : ; xn and he wants to use either two-parameter log-normal model or two parameter Weibull model, which one is preferable? It is well known that both the log-normal and Weibull models can be used quite e®ectively to analyze skewed data set.
    [Show full text]
  • Table of Contents
    The Student's SEMSTAT Author: Jack Prins Table of Contents Chapter 1........................................................................................................................................ 2 Chapter 2...................................................................................................................................... 12 Chapter 3...................................................................................................................................... 31 Chapter 4...................................................................................................................................... 45 Chapter 5...................................................................................................................................... 65 Chapter 6.................................................................................................................................... 126 Chapter 7.................................................................................................................................... 152 Chapter 8.................................................................................................................................... 170 Chapter 9.................................................................................................................................... 195 Chapter 9A................................................................................................................................. 223 Chapter 10.................................................................................................................................
    [Show full text]
  • FITTING WEIBULL and LOGNORMAL DISTRIBUTIONS to MEDIUM-DENSITY FIBERBOARD FIBER and WOOD PARTICLE LENGTH John Z
    FITTING WEIBULL AND LOGNORMAL DISTRIBUTIONS TO MEDIUM-DENSITY FIBERBOARD FIBER AND WOOD PARTICLE LENGTH John Z. Lu† Postdoctoral Research Associate Department of Wood Science and Engineering Oregon State University Corvallis, OR 97330 Charles J. Monlezun Associate Professor Department of Experimental Statistics Qinglin Wu† Roy O. Martin Sr. Professor School of Renewable Natural Resources and Quang V. Cao Professor School of Renewable Natural Resources Louisiana State University Agricultural Center Baton Rouge, LA 70803-6202 (Received July 2005) ABSTRACT Fiber lengths were analyzed for random samples of medium-density fiberboard (MDF) fibers and wood particles taken from eleven different populations. For six of the samples, the lognormal distribution fit the data, while the Weibull distribution did not. For three of the samples, the Weibull fit the data, while the lognormal did not. For two of the samples, both the lognormal and Weibull fit the data. Conclusions were based on hypothesis tests imposing a bound of 0.05 on the probability of making a Type I error for each test. Tests were based on large sample 95% nonparametric simultaneous confidence bands for the un- derlying cumulative distribution functions of the data. Keywords: Goodness of fit tests, MDF fiber, maximum likelihood estimation, non-parametric confi- dence bands, probability plots, statistical analyses. INTRODUCTION (MDF), hardboard, and wood-polymer compos- ites (Takahashi et al. 1979; Mark and Gillis Several studies have established the effect of 1983; Eckert et al. 1997; Marklund et al. 1998; wood fiber dimension and morphology on cer- Lee et al. 2001; Myers 2002; Huber et al. 2003). tain mechanical properties (e.g., bending However, suitable statistical functions that can strength and internal bonding strength) of wood be used to accurately describe wood fiber length fiber-based products such as paper, paper board, distribution over various fiber length regimes are insulation board, medium-density fiberboard still missing.
    [Show full text]
  • Exponential and Weibull Distributions
    Exponential and Weibull Distributions Exponential and Weibull Distributions Exponential Random Variables 1 Poisson random variables are concerned with the number of `arrivals', ‘flaws' or `successes' in a given interval of time or length. 2 The mean number of successes in a unit interval is λ. 3 The waiting time or distance between `successes' is an exponential random variable with density λe−λx x ≥ 0 f (x) = 0 x < 0 Exponential and Weibull Distributions The Exponential Distribution 1 The expected value and variance of an exponential distribution with parameter λ are 1 1 E(X ) = ; V (X ) = : λ λ2 2 It can be easily verified that for an exponential random variable with parameter λ, P(X > x) = e−λx P(X ≤ x) = 1 − e−λx : for x ≥ 0. Exponential and Weibull Distributions Exponential Distribution - Example Example Log ons to a departmental computer network follow a Poisson distribution with a mean of 2 log ons per minute. (i) What is the probability that there are no log ons in the next 3 minutes? (ii) What is the probability that the time to the next log on is between 1 and 2.5 minutes? (iii) What is the mean time between successive log ons to this network? Here the unit of time is the minute. Exponential and Weibull Distributions Exponential Distribution - Example Example (i) Let X denote the waiting time in minutes to the next log on. Then X has an exponential distribution with λ = 2. Hence P(X > 3) = e−2(3) = 0:0025: (ii) P(1 ≤ X ≤ 2:5) = P(X ≤ 2:5) − P(X < 1) = e−2(1) − e−2(2:5) = 0:129 1 (iii) The mean of an exponential distribution with parameter λ is λ so the mean time between successive log ons is 0:5 minutes.
    [Show full text]