Logistic Distribution
Total Page:16
File Type:pdf, Size:1020Kb
Logistic Distribution Raminta Stockute, Andrea Veaux, Paul Johnson June 10, 2013 1 Introduction The Logistic distribution is a continuous probability density function that is symmetric and uni-modal. It is similar in appearance to the Normal distribution and in practical applications, the two distributions cannot be distinguished from one another. 2 Mathematical Definition This distribution is characterized by two main parameters: location µ and scale σ. The probability density function is: (Raminta: I believe these signs are wrong. Do you agree?) 1 e−(x−µ)/σ f(x) = ∗ . σ 1 + e−(x−µ)/σ)2 The cumulative distribution of the Logistic is the famous S-shaped curve on which Logistic regression is based: 1 F (x) = . 1 + e−(x−µ)/σ 3 Moments The Compendium of the distributions (citation?) points out that the expected value is equal to the location parameter, µ: E(x) = µ Because the distribution is symmetric, the median and the mode are also equal to α. The variance of this distribution is: 1 V ar(x) = (πσ)2. 3 Compare that value against the Normal distribution, N(µ,σ2). The variance of the Logistic is different from the variance of the normal only by the scaling value of π2/3 1 4 Illustrations In Figure 1, some examples of Logistic distributions are offered. In Figures 2 and 3, the density functions that correspond to various location and scale parameters are presented. If we look at Figure 2, we see that scale controls the width and the height of the distribution. As you see in Figure 3, if we vary the location of the density, the density curve shifts to the left and right, while the “spread” and “height” remains the same. The Logistic density appears to be very similar to the Normal distribution. If we super- imposed this distribution on the Normal, however, we would see that the Logistic has thicker tails. Consider Figure 4, in which a Logistic with location 0 and scale 1 is superimposed on a graph of the Normal density in which the variance is the same. That is to say, the variance 2 2 of the Logistic√ is π /3 and the variance of the Normal is π /3 (or, equivalently, the standard deviation is π/ 3). Figure 5 illustrates the variations we see in the cumulative distribution function, when vary- ing the scale parameter. 5 Conclusion When would we use a logistic distribution? The Compendium says that is is often used instead “as an approximation to other symmetrical distributions due to the mathematical tractability of its CDF.” Put in a more simple way, the Logistic gives a nice looking S-shaped curve with a relatively simple mathematical formula. The S-shaped curve, as seen in Figure 5, is used in the so-called Logistic regression model, which uses input variables to make predictions about how likelihood of certain outcomes. The S-shaped curve of the Logistic CDF is thought to be a substantively useful description of how the probability of an “event” or other outcome rises as a function of some input variables. 2 Figure 1: Logistic Variety location=1, scale=0.5 location=1, scale=0.75 0.0 0.5 0.0 0.5 probability of x −2 0 2 4 6 probability of x −2 0 2 4 6 x x location=1, scale=1 location=1, scale=1.25 0.0 0.4 0.0 0.4 probability of x −2 0 2 4 6 probability of x −2 0 2 4 6 x x location=1, scale=1.5 location=1, scale=1.75 0.00 0.00 probability of x −2 0 2 4 6 probability of x −2 0 2 4 6 x x location=1, scale=2 location=1, scale=2.25 0.00 0.00 probability of x −2 0 2 4 6 probability of x −2 0 2 4 6 x x 3 Figure 2: Logistic Distribution: Various Scales (location=0) scale=1 scale=2 scale=3 P(x) 0.0 0.1 0.2 0.3 0.4 0.5 −10 −5 0 5 10 x 4 Figure 3: Logistric Distribution: Various Locations (scale=1) location=0 location=2 location=4 P(x) 0.0 0.1 0.2 0.3 0.4 0.5 −10 −5 0 5 10 x 5 Figure 4: Logistric and Normal Distributions Logistic(0,1) Normal(0,pi^2/3)) P(x) 0.0 0.1 0.2 0.3 0.4 0.5 −6 −4 −2 0 2 4 6 x 6 Figure 5: Cumulative Distribution of the Logistic scale=1 scale=2 scale=3 P(x) 0.0 0.2 0.4 0.6 0.8 1.0 −10 −5 0 5 10 x 7.