Editorial Manager(tm) for Journal of Productivity Analysis Manuscript Draft

Manuscript Number:

Title: A Stochastic Frontier Model with Correction for Sample Selection

Article Type: Original Research

Keywords: Stochastic Frontier; Sample Selection; Simulation;

Corresponding Author: Dr William Greene, PhD

Corresponding Author's Institution: New York University, Stern School of Business

First Author: William Greene, PhD

Order of Authors: William Greene, PhD

Manuscript Click here to download Manuscript: StochasticFrontier-Selection-JPA-Revised.doc Click here to view linked References 1 2 3 4 A Stochastic Frontier Model with Correction for Sample Selection 5 * 6 William Greene 7 Department of , Stern School of Business, 8 New York University, 9 March, 2008 10 Revised April, 2009 11 12 ______13 14 Abstract 15 16 Heckman’s (1976, 1979) sample selection model has been employed in three decades of 17 18 applications of studies. This paper builds on this framework to obtain a 19 20 sample selection correction for the stochastic frontier model. We first show a 21 22 surprisingly simple way to estimate the familiar normal-half normal stochastic frontier 23 24 model using maximum simulated likelihood. We then extend the technique to a 25 stochastic frontier model with sample selection. In an application that seems 26 27 superficially obvious, the method is used to revisit the World Health Organization data 28 29 [WHO (2000), Tandon et al. (2000)] where the sample partitioning is based on OECD 30 31 membership. The original study pooled all 191 countries. The OECD members appear 32 33 to be discretely different from the rest of the sample. We examine the difference in a 34 35 sample selection framework. 36 37 38 39 JEL classification: C13; C15; C21 40 41 Keywords: Stochastic Frontier, Sample Selection, Simulation, Efficiency 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 * 44 West 4 th St., Rm. 7-78, New York, NY 10012, USA, Telephone: 001-212-998-0876; e-mail: 60 [email protected] , URL pages.stern.nyu.edu/~wgreene . 61 62 63 1 64 65 1 A Stochastic Frontier Model with Correction for Sample Selection 2 3 4 5 1 Introduction 6 7 Heckman’s (1976, 1979) sample selection model has been employed in three decades of 8 9 applications of linear regression studies. Numerous applications have extended 10 Heckman’s approach to nonlinear settings such as the binary and Poisson 11 12 regression models. The first is Wynand and van Praag’s (1981) development of a probit 13 14 model for insurance purchase. Among a number of other recent applications, Bradford et 15 16 al. (2001) extended Heckman’s method to a stochastic frontier model for hospital costs. 17 18 The familiar approach in which a sample selection correction term is simply added to the 19 20 model of interest (see (7) and (8)) is not appropriate for nonlinear models such as the 21 stochastic frontier. In this study, we build on the maximum likelihood estimator of 22 23 Heckman’s sample selection corrected linear model and the extension to nonlinear 24 25 models by Terza (1996, 2009) to obtain a counterpart for the stochastic frontier model. 26 27 We first show a surprisingly simple way to estimate the familiar normal-half normal 28 29 stochastic frontier model using maximum simulated likelihood. The next step is to 30 31 extend the technique to a stochastic frontier model in the presence of sample selection. 32 The method is used to revisit the World Health Organization (2000) data [see also 33 34 Tandon et al. (2000)] where the sample partitioning is based on OECD membership. The 35 36 original study pooled all 191 countries (in a panel, albeit one with negligible within 37 38 groups variation). The OECD members appear to be discretely different from the rest of 39 40 the sample. We examine the difference in a sample selection framework. 41 42 43 2. A Selection Corrected Stochastic Frontier Model 44 45 The stochastic frontier model of Aigner, Lovell and Schmidt (1977) (ALS) is 46 47 specified with 48 49 β 50 yi = ββ′xi + vi - ui 51 where ui = | σuUi| = σu |Ui|, Ui ~ N[0,1], (1) 52 vi = σvVi , Vi ~ N[0,1]. 53 54 55 A vast literature has explored variations in the specification to accommodate, e.g., 56 , panel data formulations, etc. 1 It will suffice for present purposes to 57 58 59 60 1 See Greene (2008a) for further development of the model and a survey of extensions and applications. 61 62 63 2 64 65 1 A Stochastic Frontier Model with Correction for Sample Selection 2 3 4 5 work with the simplest form. Extensions will be considered later. The model can be 6 estimated by modifications of ordinary [e.g., Greene (2008a)], the 7 8 generalized method of moments [Kopp and Mullahy (1990)] or, as is conventional in the 9 10 recent literature, by maximum likelihood (ALS). [A spate of Bayesian applications has 11 12 also appeared in the recent literature, e.g., Koop and Steel (2001).] In this study, we will 13 14 suggest, a fourth estimator, maximum simulated likelihood (MSL). The simulation based 15 16 estimator merely replicates the conventional estimator for the base case, in which the 17 closed form is already available. The log for the sample selection 18 19 model does not exist in closed form, so some approximation method, such as MSL is 20 21 necessary. 22 23 24 2.1 Maximum Likelihood Estimation of the Stochastic Frontier Model 25 26 The log likelihood for the normal-half normal model for a sample of N 27 28 observations is 29 30 βββ σ λ N  1( 2) − σ− 1 ε σ2 + Φ−γε σ  log L( , , ) = ∑ logπ log (/)i log( i /) (2) 31 i=1  2 2  32 where εi = yi - βββ′xi = vi – ui, 33 γ = σ /σ , 34 u v σ σ2 + σ 2 35 = v u 36 37 Φ 38 and (.) denotes the standard normal cdf. The density satisfies the standard regularity 39 40 conditions, and maximum likelihood estimation of the model is a conventional problem 41 handled with familiar methods. Estimation is straightforward and has been installed in 42 43 the menu of supported techniques in a variety of programs including LIMDEP , and 44 45 TSP .2 46 47 Conditioned on ui, the central equation of the model in (2.1) would be a classical 48 49 linear regression model with normally distributed disturbances. Thus, 50 51 −1 −βββ′ +σ2 σ 2 52 exp[2 (yix iui |U |) / v ] f(yi|xi,|Ui|) = . (3) 53 σ2 π 54 v 55 56 57 58 59 2 Details on maximum likelihood estimation of the model can be found in ALS and elsewhere, e.g., Greene 60 (2008b, Ch. 16). 61 62 63 3 64 65 1 A Stochastic Frontier Model with Correction for Sample Selection 2 3 4 5 The unconditional log likelihood for the model is obtained by integrating the unobserved 6 random variable, |U |, out of the conditional density. Thus, 7 i 8 9 −1 −βββ′ +σ2 σ 2 10 exp[2 (yix iui |U |) / v ] f(yi|xi) = ∫ p(| Ui |) dU | i | , 11 |Ui | σ π v 2 12 φ 13 (|Ui |) 1 2 2 where p(| Ui|) = =exp[ − |U |] , | Ui| > 0, (4) 14 Φ(0) 2 i π 15 N then log L(βββ,σ ,σ ) = logf ( y |x ), 16 u v ∑i=1 i i 17 18 φ Φ 19 where is the standard normal density and is the standard normal cdf . The closed 20 form of the integral appears in (2). 3 Consider using simulation to approximate the 21 22 integrals; 23 24 −1 −βββ′ +σ2 σ 2 1 exp[2 (yix i uir |U |) / v ] 25 f(yi|xi) ≈ ∑ , (5) r=1 σ π 26 R v 2 27 28 where Uir is R random draws from the standard normal population. (There is no closed 29 30 form for the extension of the model that appears below.) The simulated log likelihood is 31 32 33 1 2 2 34 N 1 R exp[− (y −βββ′x +σ |U |) / σ ]  βββ σ σ 2 i i uir v 35 logLS (, u , v ) = ∑ log  ∑ =  . (6) i=1 R r 1 σ2 π  36  v  37 38 39 The maximum simulated likelihood estimators of the model parameters are obtained by 40 4 41 maximizing this function with respect to the unknown parameters. 42 43 44 2.2 Sample Selection in the Linear Model 45 46 Heckman’s (1979) sample selection model for the linear regression case is 47 48 specified as 49 50 51 di = 1[ ααα′zi + wi > 0], wi ~ N[0,1] 52 2 yi = βββ′xi + εi, εi ~ N[0, σε ] 53 ε ρσ σ 2 54 (wi, i) ~ N 2[(0,1), (1, ε, ε )] (7) 55 (yi,xi) observed only when di = 1. 56 57 58 3 See Weinstein (1964). 59 4 See Gourieroux and Monfort (1996), Train (2003), Econometric Software (2007), Greene (2008b) and 60 Greene and Misra (2004). 61 62 63 4 64 65 1 A Stochastic Frontier Model with Correction for Sample Selection 2 3 4 5 Two familiar methods have been developed for estimation of the model parameters. 6 Heckman’s (1979) two step, limited information method builds on the result 7 8 9 E[yi|xi,di=1] = βββ′xi + E[ εi|di=1] 10 β ρσ φ α Φ α 11 = ββ′xi + ε (αα′zi)/ (αα′zi) (8) 12 = βββ′xi + θλi. 13 14 In the first step, ααα in the probit equation is estimated by unconstrained single equation 15 16 maximum likelihood and the inverse (IMR), λ=φˆ (αˆ′z )/( Φ α ˆ ′ z ) is computed 17 i i i 18 for each observation. The second step in Heckman’s procedure involves linear regression 19 20 of y on the augmented regressor vector, x * = (x , λˆ ), using the observed subsample, with 21 i i i i 22 ααα 23 a correction of the OLS standard errors to account for the fact that an estimate of is 24 used in the constructed regressor. 25 26 The full information maximum likelihood estimator for the model is developed in 27 28 Heckman (1976) and Maddala (1983). The log likelihood function for the sample 29 30 selection model is 31 32 33 1 ′ 2 2  exp(− ((y −βββ x ) / σ ε ) )  34 2 i i ×   σ π 35 ε 2   N d   +  36 β σ αρ = i log(,L ββ ε ,,) log ρ−β′ σ+ α ′   37 ∑ i=1 ((y x )/)ε z  Φi i i   38 2   1− ρ    (9) 39   − Φ − ααα′ 40 (1di ) (z i )  41     42 N 1 ε  ( ρε / σ + ααα′z = φi Φ  iε i  + − Φ− ααα′  43 ∑ = log di   (1di ) (z i ) . i 1  σ σ − ρ 2   44  ε ε  1   45 46 This has become a conventional, if relatively less frequently used estimator that is built 47 48 into most contemporary software. 49 50 51 2.3 Estimating a Stochastic Frontier Model with Sample Selection. 52 53 54 The received literature contains many studies in which authors, have extended 55 Heckman’s selectivity model to nonlinear settings, such as (e.g., Poisson 56 57 regression – Greene (1994)), , and binary choice models. The first 58 59 application of the sample selection treatment in a nonlinear setting was Wynand and van 60 61 62 63 5 64 65 1 A Stochastic Frontier Model with Correction for Sample Selection 2 3 4 5 Praag’s (1981) development of a probit model for binary choice. The typical approach 6 taken to control for selection , motivated by (8), is to fit the probit model in (7), as in 7 8 the first step of Heckman’s two step estimator, then append λˆ (from (8)) to the linear 9 i 10 index part of the nonlinear model wherever it happens to appear. The approach is 11 12 inappropriate. The term λˆ in (8) arises as E[ ε |d =1] in a linear model. The expectation 13 i i i 14 βββ ε 15 of some nonlinear g( ′xi + i) subject to selection will generally not produce the form 16 βββ ε βββ θλ 17 E[ g( ′xi + i)| di=1] = g( ′xi + i) which can then be carried back into the otherwise 18 unchanged nonlinear model. See, e.g., Terza (1994, 1996, 1998) who develops the result 19 20 in detail for nonlinear regressions such as the exponential conditional case. 21 22 Indeed, in some cases, such as the probit and count data models, the εi for which the 23 24 expectation given di = 1 is taken does not even appear in the original model; it is unclear 25 26 as such what the correction is correcting. 27 28 The distribution of the observed random variable conditioned on the selection will 29 generally not be what it was without the selection (with or without the addition of the 30 31 inverse Mills ratio, λ to the index function). Thus, the addition of λ to the original 32 i i 33 likelihood function generally does not produce the appropriate log likelihood in the 34 35 presence of the sample selection. This can be seen even for the linear case in (9). The 36 37 least squares estimator of βββ (with λi added to the equation) is not the MLE in (9); it is 38 39 merely a feasible consistent estimator. Two well worked out specific cases do appear in 40 41 the literature. Maddala (1983) and Boyes, Hoffman and Lowe (1989) obtained the 42 appropriate closed form log likelihood for a probit model subject to sample selection. 43 44 The resulting formulation is a type of bivariate probit model, not a univariate probit 45 46 model based on ( xi,λi). Another well known example is the open form result for the 47 48 model obtained by Terza (1996,1998). 5 49 50 The combination of efficiency estimation and sample selection appears in several 51 52 studies. Bradford, et al. (2001) studied patient specific costs for cardiac revascularization 53 54 in a large hospital. They state “... the patients in this sample were not randomly assigned 55 to each treatment group. Statistically, this implies that the data are subject to sample 56 57 . Therefore, we utilize a standard Heckman two-stage sample-selection 58 59 60 5 See, also, Winkelman (1998). 61 62 63 6 64 65 1 A Stochastic Frontier Model with Correction for Sample Selection 2 3 4 5 process, creating an IMR from a first-stage probit estimator of the likelihood of CABG or 6 PTCA. This correction variable is included in the frontier estimate....” (page 306). 6 7 8 Sipiläinen and Oude Lansink (2005) have utilized a stochastic frontier, translog 9 10 model to analyze technical efficiency for organic and conventional farms. They state 11 12 “Possible selection bias between organic and conventional production can be taken into 13 14 account [by] applying Heckman’s (1979) two step procedure.” (Page 169.) In this case, 15 16 the inefficiency component in the stochastic frontier translog distance function is 17 distributed as the truncation at zero of a U with a heterogeneous mean. 7 The IMR is 18 i 19 added to the deterministic (production function) part of the frontier function. 20 21 Other authors have acknowledged the sample selection issue in stochastic frontier 22 23 studies. Kaparakis, Miller and Noulas (1994) in an analysis of commercial banks and 24 25 Collins and Harris (2005) in their study of UK chemical plants both suggested that 26 27 “sample selection” was a potential issue in their analysis. Neither of these formally 28 modified their stochastic frontier models to accommodate the result, however. 29 30 If, to motivate the sample selection treatment, we specify that the unobservables 31 32 in the selection model are correlated with the noise in the stochastic frontier model, then 33 34 the stochastic frontier model with sample selection can be cast as an extension of 35 36 Heckman’s specification for the linear regression model. The combination of the models 37 in (1) and (7) is 38 39 40 ααα 41 di = 1[ ′zi + wi > 0], wi ~ N[0,1] 2 42 yi = βββ′xi + εi, εi ~ N[0, σε ] 43 (yi,xi) observed only when di = 1. 44 εi = vi - ui 45 σ σ 46 ui = | uUi| = u |Ui| where Ui ~ N[0,1] (10) 47 vi = σvVi where Vi ~ N[0,1]. 48 2 (wi,v i) ~ N2[(0,1), (1, ρσv, σv )] 49 50 51 The conditional density for an observation in this specification is 52 53 6 The authors opt for a GMM estimator based on Kopp and Mullahy’s (1990) (KM) relaxation of the 54 distributional assumptions in the standard frontier model. It is suggested, that KM “find that the traditional 55 maximum likelihood estimators tend to overestimate the average inefficiency.” (Page 304.) KM did not, in 56 fact, make the latter argument, and we can find no evidence to support it in the since received literature. 57 KM’s support for the GMM estimator is based on its more general, distribution free specification. We do 58 note Newhouse (1994), whom Bradford et al cite, has stridently argued against the stochastic frontier model 59 as well, but not based on the properties of the MLE. 60 7 See Battese and Coelli (1995). 61 62 63 7 64 65 1 A Stochastic Frontier Model with Correction for Sample Selection 2 3 4

5  −1 −βββ′ +σ2 σ 2   6 exp( 2 (yix iui | U |) / v ) ) 7  ×    σ π  8 v 2   + 9 di   f (yi|xi,|Ui|, zi,di,) =  ρ−(yβ′x +σ | U |) / σ+ α ′ z    (11) 10  Φi iuiε i   2   11  1− ρ    12   − Φ − ααα′ 13  (1di ) (z i )  14 15 σ 16 Save for the appearance of the unobserved inefficiency term, u|Ui|, (11) is the same as 17 (9). Terza (1996, 2009) develops the log likelihood function for a generic extension of 18 19 Heckman’s result in (9) to nonlinear models. The result in (11) shows an application to 20 21 the stochastic frontier case – see (34:SS) in Terza (2009). 22 23 Sample selection arises as a consequence of the correlation of the unobservables 24 25 in the production or cost equation, vi, with those in the sample selection equation, wi. 26 27 Two other applications of this general approach to modeling sample selection or 28 endogenous switching in the stochastic frontier model have appeared in the recent 29 30 literature. In Kumbhakar, Tsionas and Similainen (2009), the model framework is very 31 32 similar to that in (10), but the selection mechanism is assumed to operate through ui 33 34 rather than vi. In particular, the disturbance in their counterpart to the equation for di is 35 36 wi + δui; in essence, the inefficiency in the production process produces an “inclination” 37 38 towards, in their case, organic farming. In Lai, Polachek and Wang’s (2009) application 39 40 to a wage equation, the wi in the selection mechanism is correlated (through a copula 41 function) with ε , not specifically with v or u . In both of these cases, the log likelihood is 42 i i i 43 substantially more complicated than the one used here. More importantly, the difference 44 45 in the assumption of the impact of the selection effect is substantive. 46 47 The log likelihood for the model in (10) is formed by integrating out the 48 49 unobserved | Ui| then maximizing with respect to the unknown parameters. Thus, as in (4) 50 51 and (5), 52 53 N 54 log L(βββ,σu,σv,ααα,ρ) = logfy ( |x ,, z dUpUdU ,| |)(| |)| | . (12) ∑i=1 ∫ |U | iiiii i i 55 i 56 57 The integral in (12) is not known; it must be approximated. The simulated log likelihood 58 59 function is 60 61 62 63 8 64 65 1 A Stochastic Frontier Model with Correction for Sample Selection 2 3 4 5  exp(−1 (y −βββ′x +σ | U |)/)2 σ 2 )    2 i i uir v × 6 di   σ2 π  7   v  8 N1 R  ρ−β′ +σ σ+ α ′   9 βββ σ σ ααα ρ (yβx ||)/ U ε α z log LS( , u, v, , )= ∑=log ∑ =  Φi i u ir i   . (13) i1R r 1 10  1− ρ 2   11     12   13   +(1 −d ) Φ− (ααα′z ) 14  i i  15 16 To simplify the estimation, we will use a two step approach. The single equation 17 18 MLE of ααα in the probit equation in (7) is consistent, albeit inefficient. For purposes of 19 20 estimation of the parameters of the stochastic frontier model, however, ααα need not be 21 22 reestimated. We take the estimates of ααα as given in the simulated log likelihood in (13), 23 24 then use the Murphy and Topel (2002) correction to adjust the standard errors in 25 26 essentially the same fashion as Heckman’s correction of the canonical selection model in 27 28 (8). Thus, the conditional simulated log likelihood function is 29 30  −1 −βββ′ +σ2 σ 2   31 exp( 2 (yix i uir | U |) / v ) ) 32  ×    σ π   33 v 2  N1 R d    34 log L (βββ,σ ,σ , ρ)= log i   . (14) S,C u v ∑i=1 ∑ r = 1  ρ−(yβββ′x +σ | U |) / σ+ a   35 R  Φi iuirε i   2   36  1− ρ    37  + − Φ−  38  (1di ) ( a i )  39 40 where a = αααˆ ′z . With this simplification, the nonselected observations (those with d = 0) 41 i i i 42 do not contribute information about the parameters to the simulated log likelihood. Thus, 43 44 the function we maximize becomes 45 46 47 exp(−1 (y −βββ′x +σ | U |)2 / σ 2 ) )  48 2 i i uir v ×  49 σ π 1 R v 2  50 βββ σ σ ρ log LS,C ( , u, v, ) = ∑=log ∑ =   . (15) 51 di 1R r 1 ρ−βββ′ +σ σ+   (yix i u ||)/ U irε a i 52 Φ   − ρ 2  53 1   54 55 The parameters of the model are estimated using a conventional gradient based approach, 56 57 the BFGS method. We use the BHHH estimator to estimate the asymptotic standard 58 59 errors for the parameter estimators. When ρ equals zero, the maximand reduces to that of 60 61 62 63 9 64 65 1 A Stochastic Frontier Model with Correction for Sample Selection 2 3 4 5 the maximum simulated likelihood estimator of the basic frontier model shown earlier. 6 This provides us with a method of testing the specification of the selectivity model 7 8 against the simpler model using a (simulated) likelihood ratio test. 9 10 11 2.4 Estimating Observation Specific Inefficiency 12 13 14 The end objective of the estimation process is to characterize the inefficiency in 15 the sample, u or the efficiency , exp(-u ). Aggregate summary measures, such as the 16 i i 17 sample mean and are often provided (e.g., Bradford, et al. (2001) for hospital 18 19 costs). Researchers also compute individual specific estimates of the conditional 20 21 based on the Jondrow et al. (1982) (JLMS) result, 22 23 σλ φµ  −λε 24 (i ) i E[|] u ε= µ+  , µ= , εi = yi - βββ′xi. (16) 25 ii1+λ2 i Φµ ( ) i σ 26 i  27 28 The standard approach computes this function after estimation based on the maximum 29 30 likelihood estimates. In principle, we could repeat this computation with the maximum 31 32 simulated likelihood estimates. An alternative approach takes advantage of the 33 34 simulation of the values of ui during estimation. Using Bayes theorem, we can write 35 36 pu(,)ε p (|)() ε upu 37 p(|) u ε = ii = ii i . (17) i i p(ε ) ε 38 i ∫ p(ii | u )() pudu i i 39 ui 40 41 Recall ui = σu|Ui|. Thus, equivalently, 42 43 pU[(σε | |),] p [|( εσ | UpU |)]( σ | |) 44 p[(σ | U |) |] ε = uii = iui ui . (18) u i i ε εσ σ σ 45 p(i ) p[ |( | UpUd |)]( | |)( | U |) ∫ u iui ui ui 46 i 47 48 The desired expectation is, then 49 50 51 σ εσ σ σ (ui |Up |)[ iui |( | Up |)]( ui | Ud |)( ui | U |) 52 ∫ σ |U | σ ε = u i E[(u | U i |) | i ] . (19) 53 p([εσ |( | UpUdU |)]( σ | |)( σ | |) ∫ σ iui ui ui 54 u |U i | 55 56 These are the terms that enter the simulated log likelihood for each observation. The 57 58 simulated denominator would be 59 60 61 62 63 10 64 65 1 A Stochastic Frontier Model with Correction for Sample Selection 2 3 4 −1 −βββˆ′ +σ2 σ 2  5 exp( 2 (yix iˆ uir | U |)/) ˆ v ) 6 ×  σ π  7 1 R ˆ v 2 1 R Bˆ =   = fˆ (20) 8 i ∑r=1 ∑ r = 1 ir R ρ−ˆ(yβββˆ′x +σˆ ||)/ U σ+ ˆ a   R 9 Φi i u irε i   2  10 1− ρˆ   11 12 13 ˆ =1 R σ ˆ ε while the numerator is simulated with Ai∑ (ˆ u | U ir |) f ir . The estimate of E[ ui| i] is 14 R r=1 15 then 16 17 R fˆ ˆ ˆ =σ =

5 Table 1 Descriptive Statistics for WHO Variables, 1997 Observations* 6 7 Non -OECD OECD All 8 Mean Std. Dev. Mean Std. Dev. Mean Std. Dev 9 COMP 70.30 10.96 89.42 3.97 73.30 12.34 10 HEXP 249.17 315.11 1498.27 762.01 445.37 616.36 11 EDUC 5.44 2.38 9.04 1.53 6.00 2.62 12 GINI 0.399 0.0777 0.299 0.0636 0.383 0.0836 13 VOICE -0.195 0.794 1.259 0.534 0.0331 0.926 14 GEFF -0.312 0.643 1.166 0.625 -0.0799 0.835 15 TROPICS 0.596 0.492 0.0333 0.183 0.508 0.501 16 POPDEN 757.9 2816.3 454.56 1006.7 710.2 2616.5 17 PUBFIN 56.89 21.14 72.89 14.10 59.40 20.99 18 GDPC 4449.8 4717.7 18199.07 6978.0 6609.4 7614.8 19 Efficiency 0.5904 0.2012 0.8831 0.0783 0.6364 0.2155 20 161 30 191 21 Sample 22 * Variables in the data set are as follows: 23 COMP = WHO health care attainment measure. 24 HEXP = Per capita health expenditure in PPP units. 25 EDUC = Average years of formal education. 26 GINI = World bank measure of income inequality. 27 VOICE = World bank measure of democratization. 28 GEFF = World bank measure of government effectiveness. 29 TROPICS = Dummy variable for tropical location. 30 POPDEN = Population density in persons per square kilometer. 31 PUBFIN = Proportion of health expenditure paid by government. 32 GDPC = Per capita GDP in PPP units. 33 Efficiency = TMLE estimated efficiency from fixed effects model. 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 18 64 65 1 A Stochastic Frontier Model with Correction for Sample Selection 2 3 4 Table 2 Estimated Stochastic Frontier Models a (Estimated standard errors in parentheses) 5 Non -OECD Co untries OECD Countries 6 7 Stochastic Sample Selection Stochastic Sample Selection Frontier Frontier 8 3.76162 3.74915 3.10994 3.38244 9 Constant (0.05429) (0.05213) (1.15519) (1.42161) 10 0.08388 0.08842 0.04765 0.04340 11 LogHexp (0.01023) (0.010228) (0.006426) (0.008805) 12 0.09096 0.09053 1.00667 0.77422 13 LogEduc (0.075150) (0.073367) (1.06222) (1.2535) 14 0.00649 0.00564 -0.23710 -0.18202 15 2 Log Educ (0.02834) (0.02776) (0.24441) (0.28421) 16 0.12300 0.12859 0.02649 0.01509 17 σσσu 0.05075 0.04735 0.00547 0.01354 18 σσσv 19 λλλ 2.42388 2.71549 4.84042 1.11413 20 σσσ 0.13306 0.13703 0.02705 0.02027 21 0.63967 -0.73001 22 0.0000 0.0000 ρρρ (1.4626) (0.56945) 23 24 logL 160.2753 161.0141 62.96128 65.44358 25 LR test 1.4776 4.9646 26 N 161 30 27 aThe estimated probit model for OECD membership (with estimated standard errors in parentheses) is 28 OECD = -8.2404 (3.369) + 0.7388LogPerCapitaGDP (0.3820) 29 + 0.6098GovernmentEffectiveness (0.4388) 30 + 0.7291Voice (0.3171) 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 19 64 65 1 A Stochastic Frontier Model with Correction for Sample Selection 2 3 4

5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 Figure 1 Efficiency Scores Related to Per Capita GDP. 27 Larger points indicate OECD members 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 20 64 65 1 A Stochastic Frontier Model with Correction for Sample Selection 2 3 4

5 6 7 8 Efficiencies from Selection SF Model vs. WHO Estimates 9 1.00 10 11 12 13 .80 14 15 16 17 18 .60 19 20 21 WHOEFF WHOEFF 22 .40 23 24 25 26 .20 27 28 29 30 .00 31 .650 .700 .750 .800 .850 .900 .950 1.000 32 33 EFFSEL 34 35 36 37 Figure 2 Estimated Efficiency Scores 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 21 64 65 1 A Stochastic Frontier Model with Correction for Sample Selection 2 3 4 5 6 Simulation vs. Plug-in Efficiency Estimates 7 1.000 8 9 .950 10 11 .900 12 13 .850 14 15 .800 16 17 .750 EFFJLMS EFFJLMS 18 19 .700 20 21 .650 22 23 .600 24 25 .550 26 .650 .700 .750 .800 .850 .900 .950 1.000 27 EFFSIM 28 29 30 31 Figure 3 Alternative Estimators of Efficiency Scores 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 22 64 65 1 A Stochastic Frontier Model with Correction for Sample Selection 2 3 4

5 6 7 8 9 175 10 11 12 13 140 14 15 16 105 17 18 19 RANKW RANKW 20 70 21 22 23 24 35 25 26 27 28 0 29 0 35 70 105 140 175 30 RANKS 31 32 33 34 Figure 4 Ranks of Countries Based on WHO and 35 Simulation Efficiency Estimates 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 23 64 65 1 A Stochastic Frontier Model with Correction for Sample Selection 2 3 4 5 References 6 7 Aigner, D., K. Lovell, and P. Schmidt, 1977, “Formulation and Estimation of Stochastic 8 Frontier Production Function Models,” Journal of , 6, pp. 21-37. 9 Battese, G. and T. Coelli, 1995, “A Model for Technical Inefficiency Effects in a 10 11 Stochastic Frontier Production for Panel Data,” Empirical Economics , 20, pp. 325-332. 12 Bradford, D., Kleit, A., Krousel-Wood, M. and Re, R., “Stochastic Frontier Estimation of 13 Cost Models within the Hospital,” Review of Economics and Statistics , 83, 2, 2001, 14 pp. 302-309. 15 Econometric Software, Inc., LIMDEP Version 9.0 , Plainview, New York, 2007. 16 17 Collins, A. and R. Harris, “The Impact of Foreign Ownership and Efficiency on Pollution 18 Abatement Expenditures by Chemical Plants: Some UK Evidence,” Scottish Journal 19 of Political Economy , 52, 5, 2005, pp. 757-768. 20 Gourieroux, C. and A. Monfort, Simulation Based Econometric Methods , Oxford: Oxford 21 22 University Press, 1996. 23 Gravelle H, Jacobs R, Jones A, Street, “Comparing the Efficiency of National Health 24 Systems: Econometric Analysis Should Be Handled with Care,” University of York, 25 Health Economics Unit, UK. Manuscript , 2002a. 26 Gravelle H, Jacobs R, Jones A, Street, “Comparing the Efficiency of National Health 27 28 Systems: A Sensitivity Approach,” University of York, Health Economics Unit, 29 Manuscript, UK, 2002b. 30 Greene, W., 1994, "Accounting for Excess Zeros and Sample Selection in Poisson and 31 Negative Models," Stern School of Business, NYU, Working 32 Paper EC-94-10. 33 34 Greene, W., 2004, “Distinguishing Between Heterogeneity and Inefficiency: Stochastic 35 Frontier Analysis of the World Health Organization’s Panel Data on National Health 36 Care Systems,” Health Economics , 13, pp. 959-980. 37 Greene, W., “The Econometric Approach to Efficiency Analysis,” in K Lovell and S. 38 Schmidt, eds. The Measurement of Efficiency , H Fried, Oxford University Press, 2008a. 39 th 40 Greene, W., Econometric Analysis, 6 ed ., Prentice Hall, Englewood Cliffs, 2008b. 41 Greene, W. and S. Misra, “Simulated Maximum Likelihood Estimation of the Stochastic 42 Frontier Model,” Manuscript, Department of Marketing, University of Rochester, 2004. 43 Heckman J., “Discrete, Qualitative and Limited Dependent Variables” Annals of 44 45 Economic and Social Measurement, 4, 5, 1976, pp. 475-492. 46 Heckman, J. “Sample Selection Bias as a Specification Error.” , 47, 1979, 47 pp. 153–161. 48 Hollingsworth J, Wildman B., 2002, The Efficiency of Health Production: Re-estimating the 49 50 WHO Panel Data Using Parametric and Nonparametric Approaches to Provide 51 Additional Information. Health Economics , 11, pp. 1-11. 52 Jondrow, J., K. Lovell, I. Materov, and P. Schmidt, 1982, “On the Estimation of 53 Technical Inefficiency in the Stochastic Frontier Production Function Model,” 54 Journal of Econometrics , 19, pp. 233-238. 55 56 Kaparakis, E., S. Miller and A. Noulas, “Short Run Cost Inefficiency of Commercial 57 Banks: A Flexible Stochastic Frontier Approach,” Journal of Money, Credit and 58 Banking , 26, 1994, pp. 21-28. 59 60 61 62 63 24 64 65 1 A Stochastic Frontier Model with Correction for Sample Selection 2 3 4 5 Kopp, R. and J. Mullahy, “-based Estimation and Testing of Stochastic Frontier 6 Models,” Journal of Econometrics , 46, 1/2, 1990, pp. 165-184. 7 Koop, G. and M. Steel, “Bayesian Analysis of Stochastic Frontier Models,” in B. Baltagi, 8 ed., Companion to Theoretical Econometrics , Blackwell Publishers, Oxford, 2001. 9 Kumbhakar, S., M. Tsionas and T. Sipilainen, “Joint Estimation of Technology Choice and 10 11 Technical Efficiency: An Application to Organic and Conventional Dairy Farming,” 12 Journal of Productivity Analysis , 31, 3, 2009, pp. 151-162. 13 Lai, H., S. Polachek and H. Wang, “Estimation of a Stochastic Frontier Model with a 14 Sample Selection Problem,” Working Paper, Department of Economics, National 15 16 Chung Cheng University, Taiwan, 2009. 17 Maddala, G., Limited Dependent and Qualitative Variables in Econometrics, Cambridge: 18 Cambridge University Press, 1983. 19 New York Times , Editorial: “World’s Best Medical Care?” August 12, 2007. 20 Newhouse, J., “Frontier Estimation: How Useful a Tool for Health Economics?” Journal of 21 22 Health Economics , 13, 1994, pp. 317-322. 23 Pitt, M., and L. Lee, 1981, “The Measurement and Sources of Technical Inefficiency in the 24 Indonesian Weaving Industry,” Journal of Development Economics , 9, pp. 43-64. 25 Rahman, S., A. Wiboonpongse, S. Sriboonchitta and Y. Chaovanapoonphol, 2009, 26 “Production Efficiency of Jasmine Rice Producers in Northern and North-eastern 27 28 Thailand, Journal of Agricultural Economics , online, pp. 1-17 (forthcoming). 29 Sipiläinen, T. and A. Oude Lansink, “Learning in Switching to Organic Farming,” Nordic 30 Association of Agricultural Scientists, NJF Report Volume 1, Number 1, 2005. 31 http://orgprints.org/5767/01/N369.pdf 32 33 Smith, M., “Modeling Sample Selection Using Archimedean Copulas,” Econometrics 34 Journal , 6, 2003, pp. 99-123. 35 Stevenson, R., 1980, “Likelihood Functions for Generalized Stochastic Frontier 36 Estimation,” Journal of Econometrics , 13, pp. 58-66. 37 Tandon, A., C. Murray, J. Lauer and D. Evans, “Measuring the Overall Health System 38 39 Performance for 191 Countries,” World Health Organization, GPE Discussion Paper, 40 EIP/GPE/EQC Number 30, 2000. http://www.who.int/entity/healthinfo/paper30.pdf 41 Terza, J. 1994. "Dummy Endogenous Variables and Endogenous Switching in 42 Exponential Conditional Mean Regression Models," Manuscript, Department of 43 44 Economics, Penn State University. 45 Terza, J., “FIML, Method of Moments and Two Stage Method of Moments Estimators 46 for Nonlinear Regression Models with Endogenous Switching and Sample Selection,” 47 Working Paper, Department of Economics, Penn State University, 1996. 48 Terza, J. “Estimating Count Data Models with Endogenous Switching: Sample Selection 49 50 and Endogenous Treatment Effects.” Journal of Econometrics, 84, 1, 1998, pp. 129– 51 154. 52 Terza, J.V. "Parametric Nonlinear Regression with Endogenous Switching," Econometric 53 Reviews , 28, 2009, pp. 555-580. 54 55 Train, K., Discrete Choice Methods with Simulation , Cambridge: Cambridge University 56 Press, 2003. 57 Tsionas, E., S. Kumbhakar and W. Greene, “Non-Gaussian Stochastic Frontier Models,” 58 Manuscript, Department of Economics, University of Binghamton, 2008. 59 60 61 62 63 25 64 65 1 A Stochastic Frontier Model with Correction for Sample Selection 2 3 4 5 Weinstein, M., 1964, `The Sum of Values from a Normal and a Truncated Normal 6 Distribution,' Technometrics , 6, pp. 104-105, 469-470. 7 Winkelmann, R. “Count Data Models with Selectivity,” Econometric Reviews , 4, 17, 8 1998, pp. 339-359. 9 World Health Organization, The World Health Report , WHO, Geneva, 2000 10 11 Wynand, P., and B. van Praag. “The Demand for Deductibles in Private Health 12 Insurance: A Probit Model with Sample Selection.” Journal of Econometrics, 17, 13 1981, pp. 229–252. 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 26 64 65 *Title Page with Author(s)' Contact Information

1 2 3 4 A Stochastic Frontier Model with Correction for Sample Selection 5 * 6 William Greene 7 Department of Economics, Stern School of Business, 8 New York University, 9 March, 2008 10 ______11 12 13 Abstract 14 15 Heckman’s (1979) sample selection model has been employed in three decades of 16 17 applications of linear regression studies. The formal extension of the method to nonlinear 18 19 models, however, is of more recent vintage. A generic solution for nonlinear models is 20 21 proposed in Terza (1998). We have developed simulation based approach in Greene 22 (2006). This paper builds on this framework to obtain a sample selection correction for 23 24 the stochastic frontier model. We first show a surprisingly simple way to estimate the 25 26 familiar normal-half normal stochastic frontier model (which has a closed form log 27 28 likelihood) using maximum simulated likelihood. The next step is to extend the 29 30 technique to a stochastic frontier model with sample selection. Here, the log likelihood 31 does not exist in closed form, and has not previously been analyzed. We develop a 32 33 simulation based estimation method for the stochastic frontier model. In an application 34 35 that seems superficially obvious, the method is used to revisit the World Health 36 37 Organization data [WHO (2000), Tandon et al. (2000)] where the sample partitioning is 38 39 based on OECD membership. The original study pooled all 191 countries. The OECD 40 41 members appear to be discretely different from the rest of the sample. We examine the 42 difference in a sample selection framework. 43 44 JEL classification: C13; C15; C21 45 46 47 Keywords: Stochastic Frontier, Sample Selection, Simulation, Efficiency 48 49 50 51 52 53 54 55 56 57 58 59 * 44 West 4th St., Rm. 7-78, New York, NY 10012, USA, Telephone: 001-212-998-0876; e-mail: 60 [email protected], URL www.stern.nyu.edu/~wgreene. 61 62 63 1 64 65