11/2/2016
CHE384, From Data to Decisions: Measurement, Generalized Linear Model (GLM) Uncertainty, Analysis, and Modeling
• Let yi have any probability distribution so long as it Lecture 58 is from the “exponential” family – e.g., normal, log-normal, exponential, gamma, Generalized Linear Modeling chi-squared, beta, Bernoulli, Poisson, etc. – Not included are Student’s t, mixed distributions Chris A. Mack • Allow for any transformation of y (the link function) Adjunct Associate Professor – Must be monotonic and differentiable • Linear in the model parameters Link function http://www.lithoguru.com/scientist/statistics/ ⋯ © Chris Mack, 2016Data to Decisions 1 © Chris Mack, 2016Data to Decisions 2
Non-Normal Distributions Typical Distribution/Link Pairings
• For a non-normal distribution, Least Distribution Typical uses Link Name Link function Squares ≠ MLE Normal Linear-response data Identity • Combine any link function with any Exponential Exponential-response data, Inverse 1/ (exponential family) probability distribution or Gamma scale parameters for y, then find the maximum likelihood count of occurrences in Poisson Log ln estimates for the parameters fixed amount of time/space – Solve with iteratively reweighted least squares Bernoulli or outcome of single yes/no Logit ln – Many software packages can do this Binomial occurrence (logistic model) regression
© Chris Mack, 2016 Data to Decisions 3 © Chris Mack, 2016 Data to Decisions 4
What if our Response is Binary? Predicting Proportions • Sometimes the response is binary • We want to predict a proportion (or – The patient lives or dies probability), p, for a categorical variable – The part passes or fails – Fraction of people that die of a heart attack – The customer buys or doesn’t buy – Fraction of molecules that decompose • The response y will follow a Bernoulli • Consider the following linear model distribution Suppose ⋯ – , the probability of “success” (y = 1) – 1 (a function of the mean) • Problem: p is constrained to between 0 and 1 • We want to model the probability of success – This model does not force a constraint on – – Additionally, the variance is not constant
© Chris Mack, 2016Data to Decisions 5 © Chris Mack, 2016Data to Decisions 6
1 11/2/2016
Probit Regression Logistic Distribution