Simple Linear Regression Least Squares Estimates of Β0 and Β1

Simple Linear Regression Least Squares Estimates of β0 and β1 Simple linear regression involves the model ^ Y = µY jX = β0 + β1X: This document derives the least squares estimates of β0 and β1. It is simply for your own information. You will not be held responsible for this derivation. The least squares estimates of β0 and β1 are: P n (X − X¯)(Y − Y¯ ) β^ = i=1P i i 1 n − ¯ 2 i=1(Xi X) ^ ¯ ^ ¯ β0 = Y − β1X The classic derivation of the least squares estimates uses calculus toP find the β0 and β1 n − ^ 2 parameter estimates that minimize the error sum of squares: SSE = i=1(Yi Yi) . This derivation uses no calculus, only some lengthy algebra. It uses a very clever method that may be found in: Im, Eric Iksoon, A Note On Derivation of the Least Squares Estimator, Working Paper Series No. 96-11, University of Hawai'i at Manoa Department of Economics, 1996. The Derivation ^ ^ The least squares estimates are estimates β0 and β1 that minimize the error sum of squares Xn ^ 2 SSE = (Yi − Yi) : i=1 1 We can algebraically manipulate things to get Xn ^ 2 SSE = (Yi − Yi) i=1 Xn 2 = (Yi − β0 − β1Xi) i=1 Xn [ ] ¯ ¯ ¯ ¯ 2 = (Yi + Y − Y ) − β0 − β1(Xi + X − X) i=1 Xn [ ] ¯ ¯ ¯ ¯ 2 = (Y − β0 − β1X) + Yi − Y − β1Xi + β1X i=1 Xn [ ] ¯ ¯ ¯ ¯ 2 = (Y − β0 − β1X) − (β1Xi − β1X − Yi + Y ) i=1 Xn [ ] Xn [ ] ¯ ¯ 2 ¯ ¯ 2 = (Y − β0 − β1X) + (β1Xi − β1X − Yi + Y ) i=1 i=1 Xn [ ] ¯ ¯ 2 ¯ ¯ 2 = n(Y − β0 − β1X) + (β1Xi − β1X − Yi + Y ) i=1 Xn [ ] ¯ ¯ 2 ¯ ¯ 2 = n(Y − β0 − β1X) + β1(Xi − X) − (Yi − Y ) i=1 Xn [ ] ¯ − − ¯ 2 2 − ¯ 2 − − ¯ − ¯ − ¯ 2 = n(Y β0 β1X) + β1 (Xi X) 2β1(Xi X)(Yi Y ) + (Yi Y ) i=1 Xn Xn Xn ¯ − − ¯ 2 2 − ¯ 2 − − ¯ − ¯ − ¯ 2 = n(Y β0 β1X) + β1 (Xi X) 2β1 (Xi X)(Yi Y ) + (Yi Y ) i=1 i=1 i=1 = n(Y¯ − β − β X¯)2 (0 1 P P ) 2β n (X − X¯)(Y − Y¯ ) n (Y − Y¯ )2 Xn + β2 − 1 Pi=1 i i + P i=1 i (X − X¯)2 1 n (X − X¯)2 n (X − X¯)2 i i=1 i i=1 i i=1 = n(Y¯ − β − β X¯)2 00 1 h i 1 P P 2 2β n (X −X¯)(Y −Y¯ ) n (X −X¯)(Y −Y¯ ) 2 − 1 Pi=1 i i i=1P i i Xn β1 n (X −X¯)2 + n (X −X¯)2 @ i=1 i h i=1i i A ¯ 2 + P P 2 (Xi − X) n (Y −Y¯ )2 n (X −X¯)(Y −Y¯ ) P i=1 i − i=1P i i + n (X −X¯)2 n (X −X¯)2 i=1 i=1 i ( P i=1 i ) n (X − X¯)(Y − Y¯ ) 2 = n(Y¯ − β − β X¯)2 + β − i=1P i i 0 1 1 n ¯ 2 (Xi − X) 0 " i=1 # 1 P 2 Xn n − ¯ − ¯ ¯ 2 @ i=1(Xi X)(Yi Y ) A + (Yi − Y ) 1 − pP P n − ¯ 2 n − ¯ 2 i=1 i=1(Xi X) i=1(Yi Y ) 2 We're still trying to minimize the SSE, and we've split the SSE into the sum of three terms. Note that the first two terms involve the parameters β0 and β1. The first two terms are also squared terms, so they can never be less than zero. The third term is only a function of the data and not the parameter. So, we know that ( P ) n (X − X¯)(Y − Y¯ ) 2 SSE = n(Y¯ − β − β X¯)2 + β − i=1P i i 0 1 1 n ¯ 2 (Xi − X) 0 " i=1 # 1 P 2 Xn n − ¯ − ¯ ¯ 2 @ i=1(Xi X)(Yi Y ) A + (Yi − Y ) 1 − pP P n − ¯ 2 n − ¯ 2 i=1 i=1(Xi X) i=1(Yi Y ) 0 " # 1 P 2 Xn n − ¯ − ¯ ¯ 2 @ i=1(Xi X)(Yi Y ) A ≥ (Yi − Y ) 1 − pP P n − ¯ 2 n − ¯ 2 i=1 i=1(Xi X) i=1(Yi Y ) This is the minimum possible value for the SSE. We actually achieve this minimum value when the first two terms of the equation above are zero. Setting each of these two terms equal to zero gives us two equations in two unknowns, so we can solve for β0 and β1. 0 = n(Y¯ − β − β X¯)2 ( P0 1 ) n (X − X¯)(Y − Y¯ ) 2 0 = β − i=1P i i 1 n − ¯ 2 i=1(Xi X) From the first equation we get ¯ ¯ 2 0 = n(Y − β0 − β1X) ¯ ¯ ) 0 = Y − β0 − β1X ¯ ¯ ) β0 = Y − β1X From the second equation we get ( P ) n (X − X¯)(Y − Y¯ ) 2 0 = β − i=1P i i 1 n (X − X¯)2 P i=1 i n (X − X¯)(Y − Y¯ ) ) 0 = β − i=1P i i 1 n (X − X¯)2 P i=1 i n (X − X¯)(Y − Y¯ ) ) β = i=1P i i 1 n − ¯ 2 i=1(Xi X) As these are estimates, we put hats on them. We are done! We've now shown that P n (X − X¯)(Y − Y¯ ) β^ = i=1P i i 1 n − ¯ 2 i=1(Xi X) ^ ¯ ^ ¯ β0 = Y − β1X: 3.

Simple Linear Regression Least Squares Estimates of Β0 and Β1

Simple Linear Regression with Least Square Estimation: an Overview

Generalized Linear Models

Lecture 18: Regression in Practice

Chapter 2 Simple Linear Regression Analysis the Simple

1 Simple Linear Regression I – Least Squares Estimation

Linear Regression Using Stata (V.6.3)

Linear Regression in Matrix Form

Reading 25: Linear Regression

Lecture 2: Linear Regression

Chapter 11 Autocorrelation

A Bayesian Treatment of Linear Gaussian Regression

Linear Regression and Correlation