ECON4150 - Introductory Econometrics
Lecture 7: OLS with Multiple Regressors –Hypotheses tests–
Monique de Haan ([email protected])
Stock and Watson Chapter 7 2 Lecture outline
• Hypothesis test for single coefficient in multiple regression analysis
• Confidence interval for single coefficient in multiple regression
• Testing hypotheses on 2 or more coefficients
• The F-statistic • The overall regression F-statistic
• Testing single restrictions involving multiple coefficients
• Measures of fit in multiple regression model
2 • SER, R2 and R • Relation between (homoskedasticity-only) F-statistic and the R2 • Interpreting measures of fit
• Interpreting “stars” in a table with regression output 3 Hypothesis test for single coefficient in multiple regression analysis Thursday February 2 14:18:25 2017 Page 1 ______(R) /__ / ____/ / ____/ ___/ / /___/ / /___/ Statistics/Data Analysis
1 . regress test_score class_size el_pct, robust
Linear regression Number of obs = 420 F(2, 417) = 223.82 Prob > F = 0.0000 R-squared = 0.4264 Root MSE = 14.464
Robust test_score Coef. Std. Err. t P>|t| [95% Conf. Interval]
class_size -1.101296 .4328472 -2.54 0.011 -1.95213 -.2504616 el_pct -.6497768 .0310318 -20.94 0.000 -.710775 -.5887786 _cons 686.0322 8.728224 78.60 0.000 668.8754 703.189
Does changing class size,while holding the percentage of English learners constant, have a statistically significant effect on test scores? (using a 5% significance level). 4 Hypothesis test for single coefficient in multiple regression analysis
Under the 4 Least Squares assumptions of the multiple regression model:
Assumption 1: E (ui |X1i , ..., Xki ) = 0
Assumption 2: (Yi , X1i , ..., Xki ) for i = 1, ..., n are (i.i.d) Assumption 3: Large outliers are unlikely Assumption 4: No perfect multicollinearity
The OLS estimators βbj for j = 1, .., k are approximately normally distributed in large samples
In addition βbj − βj0 t = ∼ N (0, 1) SE βbj
We can thus perform, hypothesis tests in same way as in regression model with 1 regressor. 5 Hypothesis test for single coefficient in multiple regression analysis
H0 : βj = βj,0 H1 : βj 6= βj,0
Step 1: Estimate Yi = β0 + β1X1i + ... + βj Xji + ... + β1Xki + ui by OLS to obtain βbj
Step 2: Compute the standard error of βbj (requires matrix algebra) Step 3: Compute the t-statistic
act βbj − βj,0 t = SE βbj
Step 4: Reject the null hypothesis if • |t act | > critical value • or if p − value < significance level 6 Hypothesis test for single coefficient in multiple regression analysis
Does changing class size, while holding the percentage of English learners constant, have a statistically significant effect on test scores? (using a 5% significance level)
H0 : βClassSize = 0 H1 : βClassSize 6= 0
Step 1: βbClassSize = −1.10 Step 2: SE βbClassSize = 0.43 Step 3: Compute the t-statistic −1.10 − 0 t act = = −2.54 0.43 Step 4: Reject the null hypothesis at 5% significance level • | − 2.54| > 1.96 and p − value = 0.011 < 0.05
Do we reject H0 at a 1% significance level? Thursday February 2 14:18:25 2017 Page 1
______(R)7 /__ / ____/ / ____/ Confidence interval for single coefficient ___/ in / multiple /___/ / regression /___/ Statistics/Data Analysis
1 . regress test_score class_size el_pct, robust
Linear regression Number of obs = 420 F(2, 417) = 223.82 Prob > F = 0.0000 R-squared = 0.4264 Root MSE = 14.464
Robust test_score Coef. Std. Err. t P>|t| [95% Conf. Interval]
class_size -1.101296 .4328472 -2.54 0.011 -1.95213 -.2504616 el_pct -.6497768 .0310318 -20.94 0.000 -.710775 -.5887786 _cons 686.0322 8.728224 78.60 0.000 668.8754 703.189
• 95% confidence interval for βClassSize given in regression output
• 99% confidence interval for βClassSize: βbClassSize ± 2.58 · SE βbClassSize
−1.10 ± 2.58 × 0.43
(−2.21 ; 0.01) 8 Hypothesis tests on 2 or more coefficients
We add two more variables, both measuring what % of students come from families with low income
• meal pcti measures % of students in district eligible for a free lunch Tuesday February 7 11:33:07 2017 Page 1 • calw pcti measures % of students in district eligible for social assistance ______(R) under a program called “CALWORKS” /__ / ____/ / ____/ ___/ / /___/ / /___/ Statistics/Data Analysis
1 . regress test_score class_size el_pct meal_pct calw_pct, robust
Linear regression Number of obs = 420 F(4, 415) = 361.68 Prob > F = 0.0000 R-squared = 0.7749 Root MSE = 9.0843
Robust test_score Coef. Std. Err. t P>|t| [95% Conf. Interval]
class_size -1.014353 .2688613 -3.77 0.000 -1.542853 -.4858534 el_pct -.1298219 .0362579 -3.58 0.000 -.201094 -.0585498 meal_pct -.5286191 .0381167 -13.87 0.000 -.6035449 -.4536932 calw_pct -.0478537 .0586541 -0.82 0.415 -.1631498 .0674424 _cons 700.3918 5.537418 126.48 0.000 689.507 711.2767 9 Testing 2 hypotheses on 2 or more coefficients
H0 : βmeal pct = 0 H1 : βmeal pct 6= 0
Step 1: βbmeal pct = −0.529 Step 2: SE βbmeal pct = 0.038
−0.529−0 Step 3: tmeal pct = 0.038 = −13.87 Step 4: Reject the null hypothesis at 5% significance level (| − 13.87| > 1.96 )
H0 : βcalw pct = 0 H1 : βcalw pct 6= 0
Step 1: βbcalw pct = −0.048 Step 2: SE βbcalw pct = 0.059
−0.048−0 Step 3: tcalw pct = 0.059 = −0.82 Step 4: Don’t reject the null hypothesis at 5% significance level (| − 0.82| < 1.96 ) 10 Testing 1 hypothesis on 2 or more coefficients
Suppose we want to test hypothesis that both the coefficient on % eligible for a free lunch and the coefficient on % eligible for calworks are zero?
H0 : βmeal pct = 0 & βcalw pct = 0 H1 : βmeal pct 6= 0 and/or βcalw pct 6= 0
What if we reject H0 if either tmeal pct or tcalw pct exceeds 1.96 (5% sign. level)?
• If tmeal pct and tcalw pct are uncorrelated: