POLS 7050 the Exponential Family

Total Page:16

File Type:pdf, Size:1020Kb

POLS 7050 the Exponential Family POLS 7050 Spring 2009 April 23, 2009 Pulling it All Together: Generalized Linear Models Throughout the course, we've considered the various models we've discussed discretely. In fact, however, most of the models we've discussed can actually be considered under a unified framework, as examples of a class of models known as generalized linear models (usually abbreviated GLMs). To understand GLMs, we first have to get a handle on the exponential family of probability distributions. Once we have that, we can show how a bunch of the models we've been learning about are special cases of the more general class of GLMs. The Exponential Family We'll start with a random variable Z, which as realizations called z. We are typically interested in the conditional density (PDF) of Z, where the conditioning is on some parameter or parameters : f(zjθ) = Pr(Z = zj ) A PDF is a member of the exponential family of probability functions if it can be written in the form:1 f(zjθ) = r(z)s( ) exp[q(z)h( )] (1) where r(·) and q(·) are functions of z that do not depend on , and (conversely) s(·) and h(·) are functions of that do not depend on z. For reasons that will be apparent in a minute, it is also necessary that r(z) > 0 and s( ) > 0. With a little bit of algebra, we can rewrite (1) as: f(zj ) = exp[ln r(z) + ln s( ) + q(z)h( )]: (2) Gill dubs the first two components the \additive" part, and the second the \interactive" bit. Here, z will be the \response" variable, and we'll use to introduce covariates. It's important to emphasize that any PDF that can be written in this way is a member of the exponential family; as we'll see, this encompasses a really broad range of interesting densities. 1This presentation owes a good deal to Jeff Gill's exposition of GLMs at the Boston area JASA meeting, Feb. 28, 1998. 1 \Canonical" Forms The \canonical" form of the distribution results when q(z) = z. This means that we can always \get to" the canonical form by transforming Z into Y according to y = q(z). In canonical form, then, Y = Z is our \response" variable; in such circumstances, h( ) is referred to as the natural parameter of the distribution. We can write θ = h( ) and recognize that, in some instances, h( ) = (so that θ = ). This is known as the \link" function. Writing this allows us to collect terms from (2) into a simpler formulation: f[yjθ] = exp[yθ − b(θ) + c(y)]: (3) Here, • b(θ) { sometimes called a \normalizing constant" { is a function solely of the parame- ter(s) θ, • c(y) is a function solely of y, the (potentially transformed) response variable, and • yθ is a multiplicative term between the response and the parameter(s). Some Familiar Exponential Family Examples If all of this seems a bit abstract, it shouldn't. The key point to remember is that any probability distribution that can be written in the forms in (1) - (3) is a member of the exponential family. Consider three quick examples. First, we know that if a variable Y follows a Poisson distribution, we can write its density as: exp(−λ)λy f(yjλ) = : (4) y! Rearranging a bit, we can rewrite (4) as: exp(−λ)λy f(yjλ) = exp ln y! = exp[y ln(λ) − λ − ln(y!)] (5) Here, we have θ = ln(λ), and (equivalently) λ ≡ b(θ) = exp(θ), while c(y) = ln(y!). The Poisson is thus an example of a one-parameter exponential family distribution. 2 Distributional parameters that are outside of the formulation in (1) - (3) are usually called nuisance parameters. They are typically not of central interest to researchers, though we still must treat them in the exponential family formulation. A common kind of nuisance parameter is a \scale parameter" { a parameter that defines the scale of the response variable (that is, it \stretches" or \shrinks" the axis of that variable). We can incorporate such a parameter (call it φ) through a slight generalization of (3): yθ − b(θ) f(yjθ; φ) = exp + c(y; φ) (6) a(φ) When no scale parameter is present, a(φ) = 1 and (6) reverts to (3). We can illustrate such a distribution by considering our old friend, the normal distribution: 1 (y − µ)2 f(yjµ, σ2) = p exp (7) 2πσ2 2σ2 In the normal, σ2 can be thought of as a scale parameter { it defines the \scale" (variance) of Y . We can rewrite (7) in exponential form as: 1 1 f(yjµ, σ2) = exp − ln(2πσ2) − (y2 − 2yµ + µ2) 2 2σ2 1 1 1 1 = exp − ln(2πσ2) − y2 + 2yµ − µ2 2 2σ2 2σ2 2σ2 yµ µ2 y2 1 = exp − − − ln(2πσ2) σ2 2σ2 2σ2 2 " 2 # yµ − µ −1 y2 = exp 2 + + ln(2πσ2) (8) σ2 2 σ2 Here, θ = µ, which means that: • yθ = yµ, µ2 • b(θ) = 2 , • a(φ) = σ2 is the scale (nuisance) parameter, and −1 y2 2 • c(y; φ) = 2 σ2 + ln(2πσ ) . The Normal distribution is thus a member of the exponential family, and is in canonical form as well. Other distributions that are canonical members of the family include the binomial (with p as the canonical parameter), the gamma, and the inverse gaussian. Non-canonical family members include the lognormal and Weibull distributions. 3 Little Red Likelihood To estimate and make inferences about the parameters of interest, we can derive a general likelihood for the class of models expressable as (3). That (log-)likelihood is: ln L(θ; φjy) = ln f(yjθ; φ) yθ − b(θ) = ln exp + c(y; φ) a(φ) yθ − b(θ) = + c(y; φ): (9) a(φ) This expresses the log-likelihood of the parameters in terms of the data. To obtain estimates, we maximize this function in the usual way: by taking its first derivative with respect to the parameter(s) of interest θ, setting it equal to zero, and solving. The former is: @ ln L(θ; φjy) @ yθ − b(θ) ≡ S = + c(y; φ) @θ @θ a(φ) y − @ b(θ) = @θ : (10) a(φ) In GLM-speak, the quantity in (10) { the partial derivative of the log-likelihood with respect to the parameter(s) of interest { is typically called the \score function." It has a number of important properties: • S is a sufficient statistic for θ. • E(S) = 0. • Var(S) ≡ I(θ) = E[(S)2jθ], because E(S) = 0. This is known as the Fisher information (hence, I(θ)). Finally, assuming general regularity conditions (which all exponential family distributions meet), it is the case that the conditional expectation of Y is just: @ E(Y ) = b(θ) (11) @θ and @2 Var(Y ) = a(φ) b(θ) (12) @θ2 (see, e.g., McCullagh and Nelder 1989, pp. 26-29 for a derivation). This means that { for exponential family distributions { we can get the mean of Y directly from b(θ), and the variance of Y from b(θ) and a(φ). So, for our Poisson example, (11) and (12) mean that: 4 @ E(Y ) = exp(θ) @θ = exp(θ)jθ=ln(λ) = λ and @2 Var(Y ) = 1 × exp(θ)j @θ2 θ=ln(λ) = exp[ln(λ)] = λ which is exactly what we'd expect for the Poisson. Likewise, from our normal example above, (11) and (12) imply that: @ θ2 E(Y ) = @θ 2 = θjθ=µ = µ and @2 θ2 Var(Y ) = σ2 × @θ2 2 @ = σ2 × θ @θ = σ2 There are lots more interesting things about exponential family distributions, but we'll skip them in the interest of saving time. 5 Generalized Linear Models What does all this have to do with regression-like models? To answer that, let's return to our old friend, the continuous linear-normal regression model: Yi = Xiβ + ui (13) where Yi is the N × 1 vector for the response/\dependent" variable, Xi is the N × k matrix of covariates, β is a k × 1 vector of parameters, and ui is a N × 1 vector of i.i.d. Normal errors with mean zero and constant variance σ2. In this standard formulation, we usually think of Xiβ as the \systematic component" of Yi, such that E(Yi) ≡ µi = Xiβ (14) 2 In this setup, the variable Y is distributed Normally, with mean µi and variance σ , and wed can estimate the parameters β (and σ2) in the usual way. The \Generalized" Part We can generalize (14) by allowing the expected value of Y to vary with Xβ according to a function g(·): g(µi) = Xiβ: (15) g(·) is generally referred to as a \link function," because its purpose is to \link" the ex- pected value of Yi to Xiβ in a general, flexible way. We require g(·) to be continuously twice-differentiable (that is, \smooth") and monotonic in µ. Importantly, note that we are not transforming the value of Y itself, but rather its expected value. In fact, in this formulation, Xiβ informs a function of E(Yi), not E(Yi) itself. In this way, g[E(Yi)] is required to meet all the usual Gauss-Markov assumptions (linearity, homoscedasticity, normality, etc.) but E(Yi) itself need not do so. It is useful to write ηi = Xiβ where ηi is often known as the \linear predictor" (or \index"); this means that we can rewrite (15) as: ηi = g(µi) (16) Of course, this also means that we can invert the function g(·) to express µi as a function of Xiβ: 6 −1 µi = g (ηi) −1 = g (Xiβ) (17) Pulling all this together, we can think of a GLM as having three key components: 1.
Recommended publications
  • Concentration and Consistency Results for Canonical and Curved Exponential-Family Models of Random Graphs
    CONCENTRATION AND CONSISTENCY RESULTS FOR CANONICAL AND CURVED EXPONENTIAL-FAMILY MODELS OF RANDOM GRAPHS BY MICHAEL SCHWEINBERGER AND JONATHAN STEWART Rice University Statistical inference for exponential-family models of random graphs with dependent edges is challenging. We stress the importance of additional structure and show that additional structure facilitates statistical inference. A simple example of a random graph with additional structure is a random graph with neighborhoods and local dependence within neighborhoods. We develop the first concentration and consistency results for maximum likeli- hood and M-estimators of a wide range of canonical and curved exponential- family models of random graphs with local dependence. All results are non- asymptotic and applicable to random graphs with finite populations of nodes, although asymptotic consistency results can be obtained as well. In addition, we show that additional structure can facilitate subgraph-to-graph estimation, and present concentration results for subgraph-to-graph estimators. As an ap- plication, we consider popular curved exponential-family models of random graphs, with local dependence induced by transitivity and parameter vectors whose dimensions depend on the number of nodes. 1. Introduction. Models of network data have witnessed a surge of interest in statistics and related areas [e.g., 31]. Such data arise in the study of, e.g., social networks, epidemics, insurgencies, and terrorist networks. Since the work of Holland and Leinhardt in the 1970s [e.g., 21], it is known that network data exhibit a wide range of dependencies induced by transitivity and other interesting network phenomena [e.g., 39]. Transitivity is a form of triadic closure in the sense that, when a node k is connected to two distinct nodes i and j, then i and j are likely to be connected as well, which suggests that edges are dependent [e.g., 39].
    [Show full text]
  • Use of Statistical Tables
    TUTORIAL | SCOPE USE OF STATISTICAL TABLES Lucy Radford, Jenny V Freeman and Stephen J Walters introduce three important statistical distributions: the standard Normal, t and Chi-squared distributions PREVIOUS TUTORIALS HAVE LOOKED at hypothesis testing1 and basic statistical tests.2–4 As part of the process of statistical hypothesis testing, a test statistic is calculated and compared to a hypothesised critical value and this is used to obtain a P- value. This P-value is then used to decide whether the study results are statistically significant or not. It will explain how statistical tables are used to link test statistics to P-values. This tutorial introduces tables for three important statistical distributions (the TABLE 1. Extract from two-tailed standard Normal, t and Chi-squared standard Normal table. Values distributions) and explains how to use tabulated are P-values corresponding them with the help of some simple to particular cut-offs and are for z examples. values calculated to two decimal places. STANDARD NORMAL DISTRIBUTION TABLE 1 The Normal distribution is widely used in statistics and has been discussed in z 0.00 0.01 0.02 0.03 0.050.04 0.05 0.06 0.07 0.08 0.09 detail previously.5 As the mean of a Normally distributed variable can take 0.00 1.0000 0.9920 0.9840 0.9761 0.9681 0.9601 0.9522 0.9442 0.9362 0.9283 any value (−∞ to ∞) and the standard 0.10 0.9203 0.9124 0.9045 0.8966 0.8887 0.8808 0.8729 0.8650 0.8572 0.8493 deviation any positive value (0 to ∞), 0.20 0.8415 0.8337 0.8259 0.8181 0.8103 0.8206 0.7949 0.7872 0.7795 0.7718 there are an infinite number of possible 0.30 0.7642 0.7566 0.7490 0.7414 0.7339 0.7263 0.7188 0.7114 0.7039 0.6965 Normal distributions.
    [Show full text]
  • On the Scale Parameter of Exponential Distribution
    Review of the Air Force Academy No.2 (34)/2017 ON THE SCALE PARAMETER OF EXPONENTIAL DISTRIBUTION Anca Ileana LUPAŞ Military Technical Academy, Bucharest, Romania ([email protected]) DOI: 10.19062/1842-9238.2017.15.2.16 Abstract: Exponential distribution is one of the widely used continuous distributions in various fields for statistical applications. In this paper we study the exact and asymptotical distribution of the scale parameter for this distribution. We will also define the confidence intervals for the studied parameter as well as the fixed length confidence intervals. 1. INTRODUCTION Exponential distribution is used in various statistical applications. Therefore, we often encounter exponential distribution in applications such as: life tables, reliability studies, extreme values analysis and others. In the following paper, we focus our attention on the exact and asymptotical repartition of the exponential distribution scale parameter estimator. 2. SCALE PARAMETER ESTIMATOR OF THE EXPONENTIAL DISTRIBUTION We will consider the random variable X with the following cumulative distribution function: x F(x ; ) 1 e ( x 0 , 0) (1) where is an unknown scale parameter Using the relationships between MXXX( ) ; 22( ) ; ( ) , we obtain ()X a theoretical variation coefficient 1. This is a useful indicator, especially if MX() you have observational data which seems to be exponential and with variation coefficient of the selection closed to 1. If we consider x12, x ,... xn as a part of a population that follows an exponential distribution, then by using the maximum likelihood estimation method we obtain the following estimate n ˆ 1 xi (2) n i1 119 On the Scale Parameter of Exponential Distribution Since M ˆ , it follows that ˆ is an unbiased estimator for .
    [Show full text]
  • The Probability Lifesaver: Order Statistics and the Median Theorem
    The Probability Lifesaver: Order Statistics and the Median Theorem Steven J. Miller December 30, 2015 Contents 1 Order Statistics and the Median Theorem 3 1.1 Definition of the Median 5 1.2 Order Statistics 10 1.3 Examples of Order Statistics 15 1.4 TheSampleDistributionoftheMedian 17 1.5 TechnicalboundsforproofofMedianTheorem 20 1.6 TheMedianofNormalRandomVariables 22 2 • Greetings again! In this supplemental chapter we develop the theory of order statistics in order to prove The Median Theorem. This is a beautiful result in its own, but also extremely important as a substitute for the Central Limit Theorem, and allows us to say non- trivial things when the CLT is unavailable. Chapter 1 Order Statistics and the Median Theorem The Central Limit Theorem is one of the gems of probability. It’s easy to use and its hypotheses are satisfied in a wealth of problems. Many courses build towards a proof of this beautiful and powerful result, as it truly is ‘central’ to the entire subject. Not to detract from the majesty of this wonderful result, however, what happens in those instances where it’s unavailable? For example, one of the key assumptions that must be met is that our random variables need to have finite higher moments, or at the very least a finite variance. What if we were to consider sums of Cauchy random variables? Is there anything we can say? This is not just a question of theoretical interest, of mathematicians generalizing for the sake of generalization. The following example from economics highlights why this chapter is more than just of theoretical interest.
    [Show full text]
  • Notes Mean, Median, Mode & Range
    Notes Mean, Median, Mode & Range How Do You Use Mode, Median, Mean, and Range to Describe Data? There are many ways to describe the characteristics of a set of data. The mode, median, and mean are all called measures of central tendency. These measures of central tendency and range are described in the table below. The mode of a set of data Use the mode to show which describes which value occurs value in a set of data occurs most frequently. If two or more most often. For the set numbers occur the same number {1, 1, 2, 3, 5, 6, 10}, of times and occur more often the mode is 1 because it occurs Mode than all the other numbers in the most frequently. set, those numbers are all modes for the data set. If each number in the set occurs the same number of times, the set of data has no mode. The median of a set of data Use the median to show which describes what value is in the number in a set of data is in the middle if the set is ordered from middle when the numbers are greatest to least or from least to listed in order. greatest. If there are an even For the set {1, 1, 2, 3, 5, 6, 10}, number of values, the median is the median is 3 because it is in the average of the two middle the middle when the numbers are Median values. Half of the values are listed in order. greater than the median, and half of the values are less than the median.
    [Show full text]
  • A Family of Skew-Normal Distributions for Modeling Proportions and Rates with Zeros/Ones Excess
    S S symmetry Article A Family of Skew-Normal Distributions for Modeling Proportions and Rates with Zeros/Ones Excess Guillermo Martínez-Flórez 1, Víctor Leiva 2,* , Emilio Gómez-Déniz 3 and Carolina Marchant 4 1 Departamento de Matemáticas y Estadística, Facultad de Ciencias Básicas, Universidad de Córdoba, Montería 14014, Colombia; [email protected] 2 Escuela de Ingeniería Industrial, Pontificia Universidad Católica de Valparaíso, 2362807 Valparaíso, Chile 3 Facultad de Economía, Empresa y Turismo, Universidad de Las Palmas de Gran Canaria and TIDES Institute, 35001 Canarias, Spain; [email protected] 4 Facultad de Ciencias Básicas, Universidad Católica del Maule, 3466706 Talca, Chile; [email protected] * Correspondence: [email protected] or [email protected] Received: 30 June 2020; Accepted: 19 August 2020; Published: 1 September 2020 Abstract: In this paper, we consider skew-normal distributions for constructing new a distribution which allows us to model proportions and rates with zero/one inflation as an alternative to the inflated beta distributions. The new distribution is a mixture between a Bernoulli distribution for explaining the zero/one excess and a censored skew-normal distribution for the continuous variable. The maximum likelihood method is used for parameter estimation. Observed and expected Fisher information matrices are derived to conduct likelihood-based inference in this new type skew-normal distribution. Given the flexibility of the new distributions, we are able to show, in real data scenarios, the good performance of our proposal. Keywords: beta distribution; centered skew-normal distribution; maximum-likelihood methods; Monte Carlo simulations; proportions; R software; rates; zero/one inflated data 1.
    [Show full text]
  • Generalized Inferences for the Common Scale Parameter of Several Pareto Populations∗
    InternationalInternational JournalJournal of of Statistics Statistics and and Management Management System System Vol.Vol. 4 No. 12 (January-June,(July-December, 2019) 2019) International Journal of Statistics and Management System, 2010, Vol. 5, No. 1–2, pp. 118–126. c 2010 Serials Publications Generalized inferences for the common scale parameter of several Pareto populations∗ Sumith Gunasekera†and Malwane M. A. Ananda‡ Received: 13th August 2018 Revised: 24th December 2018 Accepted: 10th March 2019 Abstract A problem of interest in this article is statistical inferences concerning the com- mon scale parameter of several Pareto distributions. Using the generalized p-value approach, exact confidence intervals and the exact tests for testing the common scale parameter are given. Examples are given in order to illustrate our procedures. A limited simulation study is given to demonstrate the performance of the proposed procedures. 1 Introduction In this paper, we consider k (k ≥ 2) independent Pareto distributions with an unknown common scale parameter θ (sometimes referred to as the “location pa- rameter” and also as the “truncation parameter”) and unknown possibly unequal shape parameters αi’s (i = 1, 2, ..., k). Using the generalized variable approach (Tsui and Weer- ahandi [8]), we construct an exact test for testing θ. Furthermore, using the generalized confidence interval (Weerahandi [11]), we construct an exact confidence interval for θ as well. A limited simulation study was carried out to compare the performance of these gen- eralized procedures with the approximate procedures based on the large sample method as well as with the other test procedures based on the combination of p-values.
    [Show full text]
  • Sampling Student's T Distribution – Use of the Inverse Cumulative
    Sampling Student’s T distribution – use of the inverse cumulative distribution function William T. Shaw Department of Mathematics, King’s College, The Strand, London WC2R 2LS, UK With the current interest in copula methods, and fat-tailed or other non-normal distributions, it is appropriate to investigate technologies for managing marginal distributions of interest. We explore “Student’s” T distribution, survey its simulation, and present some new techniques for simulation. In particular, for a given real (not necessarily integer) value n of the number of degrees of freedom, −1 we give a pair of power series approximations for the inverse, Fn ,ofthe cumulative distribution function (CDF), Fn.Wealsogivesomesimpleandvery fast exact and iterative techniques for defining this function when n is an even −1 integer, based on the observation that for such cases the calculation of Fn amounts to the solution of a reduced-form polynomial equation of degree n − 1. We also explain the use of Cornish–Fisher expansions to define the inverse CDF as the composition of the inverse CDF for the normal case with a simple polynomial map. The methods presented are well adapted for use with copula and quasi-Monte-Carlo techniques. 1 Introduction There is much interest in many areas of financial modeling on the use of copulas to glue together marginal univariate distributions where there is no easy canonical multivariate distribution, or one wishes to have flexibility in the mechanism for combination. One of the more interesting marginal distributions is the “Student’s” T distribution. This statistical distribution was published by W. Gosset in 1908.
    [Show full text]
  • Procedures for Estimation of Weibull Parameters James W
    United States Department of Agriculture Procedures for Estimation of Weibull Parameters James W. Evans David E. Kretschmann David W. Green Forest Forest Products General Technical Report February Service Laboratory FPL–GTR–264 2019 Abstract Contents The primary purpose of this publication is to provide an 1 Introduction .................................................................. 1 overview of the information in the statistical literature on 2 Background .................................................................. 1 the different methods developed for fitting a Weibull distribution to an uncensored set of data and on any 3 Estimation Procedures .................................................. 1 comparisons between methods that have been studied in the 4 Historical Comparisons of Individual statistics literature. This should help the person using a Estimator Types ........................................................ 8 Weibull distribution to represent a data set realize some advantages and disadvantages of some basic methods. It 5 Other Methods of Estimating Parameters of should also help both in evaluating other studies using the Weibull Distribution .......................................... 11 different methods of Weibull parameter estimation and in 6 Discussion .................................................................. 12 discussions on American Society for Testing and Materials Standard D5457, which appears to allow a choice for the 7 Conclusion ................................................................
    [Show full text]
  • Volatility Modeling Using the Student's T Distribution
    Volatility Modeling Using the Student’s t Distribution Maria S. Heracleous Dissertation submitted to the Faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Economics Aris Spanos, Chair Richard Ashley Raman Kumar Anya McGuirk Dennis Yang August 29, 2003 Blacksburg, Virginia Keywords: Student’s t Distribution, Multivariate GARCH, VAR, Exchange Rates Copyright 2003, Maria S. Heracleous Volatility Modeling Using the Student’s t Distribution Maria S. Heracleous (ABSTRACT) Over the last twenty years or so the Dynamic Volatility literature has produced a wealth of uni- variateandmultivariateGARCHtypemodels.Whiletheunivariatemodelshavebeenrelatively successful in empirical studies, they suffer from a number of weaknesses, such as unverifiable param- eter restrictions, existence of moment conditions and the retention of Normality. These problems are naturally more acute in the multivariate GARCH type models, which in addition have the problem of overparameterization. This dissertation uses the Student’s t distribution and follows the Probabilistic Reduction (PR) methodology to modify and extend the univariate and multivariate volatility models viewed as alternative to the GARCH models. Its most important advantage is that it gives rise to internally consistent statistical models that do not require ad hoc parameter restrictions unlike the GARCH formulations. Chapters 1 and 2 provide an overview of my dissertation and recent developments in the volatil- ity literature. In Chapter 3 we provide an empirical illustration of the PR approach for modeling univariate volatility. Estimation results suggest that the Student’s t AR model is a parsimonious and statistically adequate representation of exchange rate returns and Dow Jones returns data.
    [Show full text]
  • Determination of the Weibull Parameters from the Mean Value and the Coefficient of Variation of the Measured Strength for Brittle Ceramics
    Journal of Advanced Ceramics 2017, 6(2): 149–156 ISSN 2226-4108 DOI: 10.1007/s40145-017-0227-3 CN 10-1154/TQ Research Article Determination of the Weibull parameters from the mean value and the coefficient of variation of the measured strength for brittle ceramics Bin DENGa, Danyu JIANGb,* aDepartment of the Prosthodontics, Chinese PLA General Hospital, Beijing 100853, China bAnalysis and Testing Center for Inorganic Materials, State Key Laboratory of High Performance Ceramics and Superfine Microstructure, Shanghai Institute of Ceramics, Shanghai 200050, China Received: January 11, 2017; Accepted: April 7, 2017 © The Author(s) 2017. This article is published with open access at Springerlink.com Abstract: Accurate estimation of Weibull parameters is an important issue for the characterization of the strength variability of brittle ceramics with Weibull statistics. In this paper, a simple method was established for the determination of the Weibull parameters, Weibull modulus m and scale parameter 0 , based on Monte Carlo simulation. It was shown that an unbiased estimation for Weibull modulus can be yielded directly from the coefficient of variation of the considered strength sample. Furthermore, using the yielded Weibull estimator and the mean value of the strength in the considered sample, the scale parameter 0 can also be estimated accurately. Keywords: Weibull distribution; Weibull parameters; strength variability; unbiased estimation; coefficient of variation 1 Introduction likelihood (ML) method. Many studies [9–18] based on Monte Carlo simulation have shown that each of these Weibull statistics [1] has been widely employed to methods has its benefits and drawbacks. model the variability in the fracture strength of brittle According to the theory of statistics, the expected ceramics [2–8].
    [Show full text]
  • Power Birnbaum-Saunders Student T Distribution
    Revista Integración Escuela de Matemáticas Universidad Industrial de Santander H Vol. 35, No. 1, 2017, pág. 51–70 Power Birnbaum-Saunders Student t distribution Germán Moreno-Arenas a ∗, Guillermo Martínez-Flórez b Heleno Bolfarine c a Universidad Industrial de Santander, Escuela de Matemáticas, Bucaramanga, Colombia. b Universidad de Córdoba, Departamento de Matemáticas y Estadística, Montería, Colombia. c Universidade de São Paulo, Departamento de Estatística, São Paulo, Brazil. Abstract. The fatigue life distribution proposed by Birnbaum and Saunders has been used quite effectively to model times to failure for materials sub- ject to fatigue. In this article, we introduce an extension of the classical Birnbaum-Saunders distribution substituting the normal distribution by the power Student t distribution. The new distribution is more flexible than the classical Birnbaum-Saunders distribution in terms of asymmetry and kurto- sis. We discuss maximum likelihood estimation of the model parameters and associated regression model. Two real data set are analysed and the results reveal that the proposed model better some other models proposed in the literature. Keywords: Birnbaum-Saunders distribution, alpha-power distribution, power Student t distribution. MSC2010: 62-07, 62F10, 62J02, 62N86. Distribución Birnbaum-Saunders Potencia t de Student Resumen. La distribución de probabilidad propuesta por Birnbaum y Saun- ders se ha usado con bastante eficacia para modelar tiempos de falla de ma- teriales sujetos a la fátiga. En este artículo definimos una extensión de la distribución Birnbaum-Saunders clásica sustituyendo la distribución normal por la distribución potencia t de Student. La nueva distribución es más flexi- ble que la distribución Birnbaum-Saunders clásica en términos de asimetría y curtosis.
    [Show full text]