Econometrica, Vol. 73, No. 6 (November, 2005), 1723–1770

ESTIMATING THE EFFECTS OF A TIME-LIMITED EARNINGS SUBSIDY FOR WELFARE-LEAVERS

BY DAVID CARD AND DEAN R. HYSLOP1 In the Self Sufficiency Project (SSP) welfare demonstration, members of a randomly assigned treatment group could receive a subsidy for full-time work. The subsidy was available for 3 years, but only to people who began working full time within 12 months of random assignment. A simple optimizing model suggests that the eligibility rules created an “establishment” incentive to find a job and leave welfare within a year of random assignment, and an “entitlement” incentive to choose work over welfare once eligibility was established. Building on this insight, we develop an econometric model of welfare participation that allows us to separate the two effects and estimate the im- pact of the earnings subsidy on welfare entry and exit rates among those who achieved eligibility. The combination of the two incentives explains the time profile of the experi- mental impacts, which peaked 15 months after random assignment and faded relatively quickly. Our findings suggest that about half of the peak impact of SSP was attribut- able to the establishment incentive. Despite the extra work effort generated by SSP, the program had no lasting impact on wages and little or no long-run effect on welfare participation.

KEYWORDS: Welfare reform, state dependence, Self Sufficiency Project.

OVER THE PAST DECADE the United States, Great Britain, and other countries have reformed their income support systems to enhance the financial incen- tives for work (see, e.g., Blundell and Hoynes (2004)). Traditional means-tested welfare programs impose high tax rates on program participants, reducing or even eliminating the payoff to work. Many analysts have argued that the re- sult is a dynamic welfare trap. Once in the system, welfare recipients have little incentive to work, and the subsequent erosion of their skills and work habits makes it less likely they can leave in the future.2 In the early 1990s the Canadian government funded an innovative demon- stration project—the Self Sufficiency Project (SSP)—designed to test whether a time-limited earnings subsidy could help long-term welfare recipients make a permanent break from program dependency. Unlike the earnings subsidy

1We are grateful to SDRC for research support and for making the SSP data available, and to three anonymous referees, Costas Meghir, Douglas Tattrie, Ken Chay, Guido Imbens, David Lee, Jack Porter, James Powell, Charles Michalopoulos, and seminar participants at UC Berke- ley, University of Melbourne, Princeton University, and University College London for helpful discussions. All conclusions in this paper are solely the responsibility of the authors and do not represent the opinions or conclusions of SDRC or the sponsors of the Self Sufficiency Project. An appendix with our computer programs and some additional results is available from the Sup- plemental Material website. 2See Plant (1984) for an explicit model of a dynamic welfare trap. The idea that welfare partic- ipation creates a dependency trap is very old. For example, Hexter (1917) analyzed the duration of relief spells at a private charity and found that people on relief longer had a lower likelihood of leaving. High implicit tax rates also create incentives to participate in the underground sector: see Fortin, Fechette, and Lemieux (1994).

1723 1724 D. CARD AND D. R. HYSLOP programs of the United States and United Kingdom, SSP was only available for full-time work.3 Moreover, participants had to begin receiving SSP within a year of being informed of their potential eligibility—otherwise they lost all future eligibility. Those who met the deadline were entitled to 3 years of eligi- bility and could move back and forth between welfare and work, receiving the subsidy whenever they were working full time. At the end of the 3-year period they returned to the regular welfare environment. The SSP evaluation used a randomized design. One-half of a group of long-term welfare recipients was offered the supplement, while the other half remained in the regular welfare system. Data were collected for 6 years to mea- sure the short-term impacts of the subsidy and any lasting effects once pay- ments ended. Comparisons between the treatment and control groups show that SSP had significant short-term impacts on welfare participation and work, raising the full-time employment rate and lowering welfare participation by 14 percentage points within the first 18 months of the experiment.4 The effects of SSP faded over time, however. By the third year after random assignment, the difference in welfare participation between the treatment and control groups had fallen to 7.5%, and by 69 months (a year and a half after all subsidy payments ended) the welfare participation rates of the two groups were equal. The key contribution of this paper is to identify, both theoretically and em- pirically, the combination of incentives created by the time-limited eligibility rules of SSP and to investigate the impact of SSP on welfare participation in light of these issues.5 We first develop a simple optimizing model that sug- gests that the rules of SSP generated both an “establishment” incentive to find a full-time job and leave welfare within a year of random assignment, and an “entitlement” incentive to choose work over welfare once eligibility was achieved. Simple experimental comparisons, while valid, confound these two

3The U.S. subsidy program, the Earned Credit (EITC), has no hours limitation, while the U.K. subsidy, currently known as the Working Tax Credit (WTC), requires only part- time work (16 hours per week). These programs are mainly available to low income families with children, although recent changes have extended entitlement to other individuals. Phelps (1994) has argued for a general wage subsidy program for all low wage workers. 4A complete final report on the experiment is available in Michalopoulos et al. (2002). This is a larger impact than was observed in other recent welfare reforms experiments in the United States For example, Hamilton et al. (2001) review the 5-year impacts of 11 alternative welfare-to-work experiments on employment and welfare outcomes, and report typical impacts on employment and welfare participation of less than half that reported for SSP. 5To the best of our knowledge, the implications for the dynamic nature of the incentives cre- ated by the time-limited eligibility rule has received little, if any, attention. For example, there is no discussion of this issue in the background paper by Greenberg, Meyer, Michalopoulos, and Robins (1992), which uses a static labor supply model to evaluate alternative subsidy parameters. Similarly, the SSP Implementation Report by Mijanovich and Long (1995, pp. 28–29) mentions six key questions that were addressed in the design of SSP, but the issue of time-limited eligibility is not discussed. The unusual nature of the incentives created by the time-limited eligibility is not discussed in the final SSP Recipient Report (Michalopoulos et al. (2002)) or any of the earlier reports. EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1725 effects. Given the insights of this model, we then develop a dynamic economet- ric framework that combines the experimental randomization associated with SSP with parametric modeling to identify the impact of the selection and treat- ment effects of SSP on welfare transitions. (As noted by Ham and LaLonde (1996), even with a randomly assigned intervention the estimation of dynamic impacts requires a full specification of the process that generates individual welfare histories.) We begin by developing a suitable model of welfare entry and exit behav- ior among the SSP control group. We show that the control group’s outcomes are reasonably well described by a dynamic logistic model with second order state dependence and unobserved heterogeneity. We then augment this base- line model with treatment effects that represent the establishment and entitle- ment incentives of SSP, and a model of the eligibility process that accounts for the potential correlation between the probability of entering or leaving welfare and the probability of attaining SSP eligibility. Our empirical results suggest that the unusual time profile of the experi- mental impacts observed in the SSP demonstration arose from a combination of the short-term eligibility incentive and the longer-term entitlement incen- tives of the program. Indeed, our estimates attribute over one-half of the peak impact of SSP to the incentives created by the time limit on eligibility. This decomposition helps to reconcile the large but short-lived peak impacts of the SSP demonstration with the impacts observed in other experimental welfare reforms. Our estimates also suggest that members of the program group with a higher probability of leaving welfare were more likely to establish entitle- ment for SSP. As a result, differences between the observed transition rates of the SSP-eligible subgroup and the control group overstate the causal effect of the supplement, even in the later years of the experiment. Finally, our analysis of wage outcomes shows that SSP had no lasting effect on wages, despite the extra work effort engendered by the program’s incentives. Thus, while the pro- gram generated a short-term reduction in welfare dependency, it had little or no permanent effect on long-term self-sufficiency.

1. THE SSP DEMONSTRATION—DESCRIPTION AND OVERVIEW OF IMPACTS A. Income Assistance Programs and the SSP Recipient Experiment Under the regular welfare system available to low income families in Canada, known as Income Assistance (IA), payments are reduced dollar-for- dollar for any earnings beyond a modest set-aside amount.6 The implicit 100% tax rate on earnings, coupled with the availability of other benefits (e.g., free

6The IA program is operated at the provincial level, but all the provincial programs share several important features, including a dollar-for-dollar benefit reduction rate. See Human Re- sources and Development Canada (1993) for a detailed inventory and description of income support programs in Canada in the early 1990s. 1726 D. CARD AND D. R. HYSLOP dental services), reduces the incentives for welfare recipients to work more than a few hours per week. Rising case loads in the 1980s led to concerns that the Canadian welfare system was promoting long-term dependency. Against this backdrop the Self Sufficiency Project was conceived as a test of a gener- ous time-limited earnings subsidy for long-term welfare recipients. The over- all project consisted of three separate demonstrations: the SSP “Recipient” study, conducted on long-term welfare recipients; the SSP “Plus” study, also conducted on long-term recipients, but including both financial incentives and program services; and the SSP “Applicant” study, conducted on new welfare applicants. We focus here on the Recipient study and, henceforth, simply refer to it as SSP.7 Table I summarizes the main features of the study, including the eligibility criteria for the experimental sample and details of the subsidy formula. The

TABLE I KEY FEATURES OF THE SSP RECIPIENT DEMONSTRATION

A. Program eligibility • Eligibility limited to single parents who have received Income Assistance (IA) for at least 12 months. • Sample members drawn from IA registers in British Columbia and New Brunswick, with random assignment between November 1992 and February 1995. • 2,858 single parents assigned to the program group; 2,826 assigned to the control group.

B. Program features • Subsidy payments available to program group members who work at least 30 hours per week (over a 4-week or monthly accounting period) and earn at least the . • Subsidy recipients become ineligible for IA. • Subsidy equals one-half of the difference between actual earnings and an earnings bench- mark, set at $2,500 per month ($30,000 per year) in New Brunswick and $3,083 per month ($37,000 per year) in British Columbia in 1993, and adjusted for inflation in subsequent years. • Subsidy payments are unaffected by unearned income or the earnings of a spouse/partner, and are treated as regular income for income tax purposes. • Subsidy payments are available for 36 months from time of first payment. Payments are only available to program group members who successfully initiate their first supplement payment within 1 year of random assignment. • Once eligible, program group members can return to IA at any time. Subsidy is re- established when an eligible person begins working full time again. • Employers are not informed of SSP status. Program group members apply for subsidy payments by mailing copies of payroll forms.

7See Lin et al. (1998) for a comprehensive description of the program and results from the first 18 months of the experiment, Michalopoulos et al. (2000) for a summary of results in the first 36 months, and Michalopoulos et al. (2002) for the final report on the experiment. These reports also provide summary information on the SSP Plus and Applicant studies. EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1727 demonstration was conducted in two provinces—British Columbia and New Brunswick—with random assignment between late 1992 and early 1995. Sam- ple members were drawn from the pool of single-parent IA recipients who were over 18 years of age and had received welfare in at least 11 of the previ- ous 12 months.8 These requirements meant that nearly everyone in the sample had been on welfare continuously for at least a year. The SSP subsidy formula is equivalent to a with a 50% tax rate, a “guarantee level” somewhat above average welfare benefits (but independent of family size), and a minimum hours requirement of at least 30 hours per week.9 The formula was designed to provide much stronger work incentives than regular IA. For example, in 1994 a single mother with one child in New Brunswick was eligible for a maximum IA grant of $712 per month. If she were to leave welfare and take a job paying the minimum wage ($5.00 per hour) for 30 hours per week, her gross earnings would be only $650 per month, so her welfare grant would be reduced by the full amount of her earnings. Under SSP, however, the same person would receive a subsidy of $925 (= 05 × (2500 − 650)), raising the relative payoff for work versus wel- fare to $863 per month, or $6.64 per hour. Since SSP payments were taxable, and also affected subsidized daycare costs, the payoff net of taxes and transfers is about 30% smaller, but still large (see Lin et al. (1998, Table G.1)). A distinctive feature of SSP is the one-year time limit for establishing sub- sidy entitlement. People in the treatment group who failed to initiate a subsidy payment by the end of the time limit lost all future eligibility, creating a strong financial incentive to find a full-time job within a year of entering the demon- stration. For example, a single mother with one child in New Brunswick in 1994 faced an SSP entitlement of up to $33,300 in additional income over 36 months (from working 30 hours per week at the minimum wage). Since some of the behavioral response to the program was arguably attribut- able to this “establishment” incentive, the restricted eligibility window acts to confound the interpretation of simple experimental comparisons of the out- comes of the treatment and control groups. It also makes it difficult to compare the experimental impacts of SSP to the effects of other welfare reform pro- grams. The key goal of our econometric analysis is to disentangle the effects of the establishment incentive from the longer-term entitlement incentive among those who achieved eligibility. Before turning to a more explicit consideration

8No further limitations were placed on the sample. Thus, the experimental sample is in prin- ciple representative of the population of IA recipients who had been receiving welfare for a year or more in the two provinces. Roughly 90% of people who were contacted to participate in the experiment signed an informed consent decree and completed the baseline survey, and were then randomly assigned (Lin et al. (1998, p. 8)). 9In a conventional negative income tax with constant tax rate t and guaranteed (or minimum) income G, an individual with earnings y receives a subsidy of G − ty. This is equivalent to an earnings supplement equal to t times the difference between actual earnings and the “break- even” level B = G/t. 1728 D. CARD AND D. R. HYSLOP of the incentive effects of SSP, however, we summarize the key experimental findings from the demonstration.

B. The SSP Sample Characteristics and Basic Impacts on Welfare The data associated with the SSP experiment were assembled from three separate sources. Income Assistance data were obtained from provincial wel- fare records. The SSP participation and receipt data were collected from SSP administrative records. Demographic data and labor market outcomes were obtained from surveys conducted at 18-month intervals, starting with a base- line survey just prior to random assignment. Table II gives an overview of the characteristics of the SSP sample. Columns 1 and 2 of the table show the mean characteristics of the control and program groups of the experiment, while the

TABLE II CHARACTERISTICS OF SSP EXPERIMENTAL SAMPLEa

Program Group, by SSP Eligibility Status

Controls Programs Eligible Ineligible In British Columbia (%) 526532509544 Male (%) 47524655 Mean age 319319311324 Age25orless(%) 178171185163 Never married (%) 481483480485 Average number kids <607070707 Average number kids 6–15 08080808 Immigrant (%) 138133122139 Grew up with two parents (%) 597594621581 High school graduate (%) 446457569399 Means years work exp. 74738667 Working at random assignment (%) 190182315114 Months on IA last 3 years 296301292306 IA continuously last 3 years (%) 415438363477 Percent on IA by months since random assignment Month 6 908831628935 Month 12 837724391894 Month 18 779659272856 Month 24 730633265821 Month 36 654588276748 Month 48 567535293659 Month 60 506484285585 Month 69 450450254550 Number of observations 2,786 2,831 957 1,874

aSample includes observations in the SSP Recipient Experiment who were on IA in the month of random assign- ment and the previous month. Eligible program group is the subset who received at least one SSP subsidy payment. EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1729 third and fourth columns distinguish between individuals in the program group who were either successful or unsuccessful in establishing eligibility. Although the design of the experiment called for an intake group of single parents who had been on welfare continuously for at least a year, a small number of peo- ple selected for the sample left IA prior to the date of random assignment. To simplify our empirical analysis, we ignore these people and focus on the 5,617 observations who were still on IA at the time of random assignment.10 Income assistance records are available for 69 months following random assignment, 18 months after the final SSP subsidy payment was made. Random assignment of the treatment and control groups ensures that the “preassignment” characteristics of the two groups are statistically indistin- guishable. The sample mainly comprises single mothers, with a mean age of 32 and an average of 1.5 children. Sample members show many of the charac- teristics associated with poor labor market outcomes, including a low rate of high school graduation (45% versus roughly 70% in the adult population of Canada) and a high probability of being raised by a single parent. Nevertheless, average work experience is relatively high (7.3 years), and about 20% of the sample were working at random assignment. Overall, one-third of the program group managed to establish eligibility for SSP payments. The two right-hand columns of the table report the characteristics of the eligible and ineligible subgroups. The eligible subgroup was younger and, as one might expect, better educated and more likely to be working just prior to random assignment. The lower panel of Table II reports welfare participation rates at 6-month intervals after random assignment.11 The control group (column 1) shows a steady decline in welfare participation, ending up with a 45% participation rate at the end of the sample period. Relative to this counterfactual trend, the program group shows a faster initial drop. The decline is especially rapid for the SSP-eligible subgroup (column 3), but much slower for those who failed to establish eligibility (column 4), presumably reflecting both the incentive ef- fects of SSP and the selective nature of the eligibility process. The impact of selectivity is particularly clear in the last month of the sample. At this point the average IA participation rates of program and control groups are equal, sug- gesting that SSP had no permanent impact,12 but the welfare participation rate

10As we discuss below, this restriction eliminates any “initial conditions” problems associated with the second order dynamic models we consider in the subsequent analysis. A total of 40 pro- gram group members and 27 treatment group members are excluded by this requirement. The difference in probabilities between the groups has a p-value of 10%. Since people did not know their program status (treatment or control) until after random assignment, we believe that the difference is coincidental. 11Some individuals may have left their original province and entered welfare in another province. These individuals would be coded as having 0 welfare benefits. SSP payments were available to people who left their original province. 12It is possible that SSP had offsetting long-run impacts on the eligible and ineligible subgroups of the program group. Based on the nature of the program, however, we believe this is extremely unlikely. 1730 D. CARD AND D. R. HYSLOP of the eligible subgroup is far below the average of the control group, while the rate of the ineligible subgroup is far above. Evidently, eligibility was more likely for those with a lower long-run probability of remaining on welfare. Further insight into the impacts of the program and the behavior of the eli- gible subgroup are provided in Figures 1a–1c, 2a and 2b. Figure 1a shows aver- age IA participation rates in each month after random assignment, along with the estimated program impact (the difference in means between the program and control groups). The impact of SSP peaked at −14 percentage points in month 15, declined steadily to −7 percentage points in month 36, and contin- ued to decline further as people who were eligible for subsidy payments came to the end of their 3-year eligibility period.13 By month 52 all SSP payments had ended: at that point the gap in welfare participation between the program and control groups was 2.5% (standard error 1.3%). The gap continued to close for the remainder of the sample period, converging to 0 by month 69. Figures 1b and 1c show the welfare exit and entry rates of the control and program groups in each month after random assignment. Because of the se- lective risk sets for these conditional probabilities, differences in the entry and exit rates between the program and control group do not necessarily rep- resent the causal effects of the program on transition rates. That said, the exit rate of the program group was 1–2 percentage points higher in the first

FIGURE 1a.—Monthly income assistance participation rates.

13There is some slippage in the measurement of the date of SSP eligibility, discussed below, although most of those who became eligible did so between 2 and 15 months after random as- signment. EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1731

FIGURE 1b.—Exit rates from income assistance.

15 months of the experiment, then about half a point higher over the period from 15–48 months after random assignment (while program group members could receive SSP payments), and finally about equal to the rate for the con-

FIGURE 1c.—Entry rates into income assistance. 1732 D. CARD AND D. R. HYSLOP

FIGURE 2a.—Program participation and full-time work around first month of SSP receipt. trols in the period after the end of SSP eligibility.14 Conversely, welfare entry rates of the program group were 2–4 percentage points below those of the con-

FIGURE 2b.—Supplement receipt and IA participation around the end of SSP eligibility.

14The initial peak in the difference in IA exit rates at months 3–4 corresponds with the exit of program group members who were working full time at random assignment. The later peak (months 14–16) corresponds with the exit of program group members near the end of the eligi- bility window. EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1733 trols in the first 15 months after random assignment, about half a point lower in the period from 18 to 48 months after random assignment, and about equal to those of the control group after month 50. The program group also had rela- tively high welfare entry rates 15–18 months after random assignment, perhaps reflecting the decision of some program group members to take an unattrac- tive job near the end of the eligibility window, establish an entitlement for SSP, and then quit and return to welfare. Figures 2a and 2b focus on the behavior of the eligible program subgroup at the start and end of their eligibility period. Figure 2a shows monthly IA partic- ipation rates, full-time employment rates, and the fraction of people receiving SSP payments around the date of the first SSP check (month 0 on the graph). Following the jump associated with the first subsidy check, the SSP recipiency rate gradually declines to about 60%. As expected given the eligibility rules, the rate of full-time employment rises prior to the date of the first SSP check, reaching a maximum of about 80% in the month before the check. (We at- tribute the fact that the rate never reaches 100% to recall and measurement errors in the labor market data—see below.) Assuming that people in the pro- gram group became eligible once they started working full time, there is about a 1-month delay between eligibility and the dating of the supplement check.15 The SSP rules required supplement takers to leave IA, creating a mechanical link between the initiation of SSP eligibility and subsequent IA participation.16 However, IA eligibility is based on retrospective income flows, leading to a delay between the start of SSP recipiency and the end of IA. Figure 2b shows welfare behavior and supplement recipiency rates near the close of the eligibility period. Again, we have aligned the data relative to the month of the first SSP check. Just before the end of eligibility about 50% of the eligible group were still receiving subsidy checks. The rate drops sharply at 37 months, reflecting the 3-year maximum eligibility rule.17 The end of sub- sidy eligibility was associated with a spike in IA entry rates and a roughly 4 per- centage point rise in IA participation, suggesting that some people who were SSP eligible returned to IA as soon as their eligibility ended.

C. ImpactsonLaborMarketOutcomes The SSP study included surveys at approximately 18, 36, and 54 months af- ter random assignment that collected labor market outcomes of the treatment

15The SSP recipients were required to mail their pay stubs to an administrative office to verify their employment. Mailing and processing would be expected to generate at least a month delay between the actual commencement of full-time work and the issuance of the first SSP check. 16This was implemented by having SSP staff notify the appropriate Income Assistance office that an individual was about to begin receiving subsidy payments. 17There is a small number of cases that received checks 37 or even 38 months after the first check date. We attribute this to errors in the dating of the checks and other measurement prob- lems. 1734 D. CARD AND D. R. HYSLOP and control groups. Unfortunately, these data have some critical limitations relative to the administratively based Income Assistance data. Most impor- tantly, they are only available for 52 months after random assignment. Since some program group members were still receiving subsidy payments as late as month 52, this time window is too short to assess the long-run effects of the program. Indeed, looking at Figure 1a, there is still an impact on IA partici- pation in month 52 that does not fully dissipate until month 69. Second, be- cause of nonresponses and refusals, labor market information is only available for 85% of the experimental sample (4,757 people).18 Third, there appear to be relatively large recall errors and seam biases in the earnings and wage data.19 Nevertheless, the labor market outcomes provide a valuable complement to the administratively based welfare participation data. Figure 3 shows the average monthly employment rates of the program and control groups, along with the associated experimental impacts. After ran- dom assignment the employment rate of the control group shows a steady

FIGURE 3.—Monthly employment rates.

18The distribution of response patterns to the 18-, 36-, and 54-month surveys is fairly simi- lar for the program and control groups (chi-squared statistic = 11.4 with 7 degrees of freedom, p-value = 0.12). However, a slightly larger fraction of the program group have complete labor market data for 52 months—85.4% versus 84.0% for the controls. Moreover, the difference in mean IA participation between the treatment and control groups in month 52 is a little different in the overall sample (2.5%) than in the subset with complete labor market histories (3.3%). 19Each of the three post-random-assignment surveys asked people about their labor market outcomes in the 18 months since the previous survey. Many people report constant earnings over the recall period, leading to a pattern of measured pay increases that are concentrated at the seams, rather than occurring more smoothly over the recall period. EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1735 upward trend. Relative to this trend the program group shows a faster rise in the first year of the experiment, reaching 40% by month 13 and stabiliz- ing thereafter. The estimated program impact peaks at about 14 percentage points in months 12–13, declines to about 6 percentage points by month 36, and falls to 0 by month 52. Data for average monthly earnings (presented in Card and Hyslop (2005)) show a similar time profile, although there are notable “jumps” for both experimental groups at months 18 and 36, attributable to the seams between surveys. The experimental earnings impact peaks at about $125 per month in months 12–13 and declines to 0 by month 52. A key issue for interpreting the observed impact of SSP is the quality of the jobs taken by members of the program group who would not have been work- ing in the absence of the program. Consistent with the fact that most long-term welfare recipients have limited job market skills, and evidence that minimum wages were set at relatively high levels in British Columbia and New Brunswick in the mid-1990s (Fortin and Lemieux (2001)), Figure 4 suggests that most of these jobs paid close to the minimum wage. The upper line in the graph is the difference in the fractions of the program and control groups with a re- ported wage in each month. This is approximately equal to the difference in monthly employment rates. The dotted line in the figure represents the differ- ence in the fraction of people who report an hourly wage within 25 cents of the province-specific minimum wage for the appropriate calendar month. (Note that the denominator of this fraction includes everyone in the program or con- trol group, not just those who report a wage.) Because of measurement errors

FIGURE 4.—Distribution of added employment in the program group. 1736 D. CARD AND D. R. HYSLOP in wages this is arguably an underestimate of the fraction of extra workers in the program group that earned close to the minimum.20 Themiddleline(with open square markers) shows the excess fraction of the program group earn- ing between +$1 and −$0.25 of the minimum wage. Again, this is probably an underestimate of the true fraction. Even with potential attenuation biases, however, 60–80% of the extra wage earners in the program group were paid within $1 per hour of the minimum wage. Under two key assumptions—that SSP had no effect on wages for people who would have been working in the absence of the program and only positive effects on labor supply—the ratio of the differences in earnings and hours of the program and control groups provides a consistent estimate of the average rate of pay earned during the extra hours of work attributable to SSP. To see 0 this, let hit represent the hours of work of individual i in month t in the ab- 1 sence of SSP, let hit represent hours of work of i in month t if she is assigned = 1 − 0 to the program, let hit hit hit denote the treatment effect on hours for in- 0 1 dividual i,andletwit and wit represent average hourly earnings in the absence or presence of SSP. Because of random assignment, the difference in average monthly hours of the program and control groups in month t is a consistent estimate of E[hit ]. Likewise, the difference in average monthly earnings is a consistent estimate of

[ 1 1 − 0 0 ]= [ 1 + 0 ] E withit withit E withit hitwit

= 1 − 0 where wit wit wit . Thus, the ratio of the difference in mean earnings of the program and control groups to the corresponding difference in mean hours is a consistent estimate of

≡ [ 1 + 0 ] [ ] mt E withit hitwit /E hit

If wages for people who would have worked in the absence of SSP were unaf- [ 0 ]= fected by the presence of the program, then E hitwit 0and

= [ 1 ] [ ] mt E withit /E hit

20The wage data appear to be quite noisy, but the density of reported wages is highest right around the minimum wage. Assuming this is also true of the density of true wages and that misclassification errors are symmetric, the observed fraction of workers earning near the mini- mum wage understates the true fraction. Formally, let p(j) represent the true fraction of workers earning wages in interval j,letjm denote the interval that includes the minimum, assume that p(jm)>p(j)for all other intervals j, and assume that an individual with a wage in interval j has probability 1 − q of being correctly classified in that interval, probability q/2 of being classified in interval j + 1, and probability q/2 of being classified in interval j − 1. Then the observed fraction of people in interval jm is (1 − q)p(jm) + (p(jm − 1) + p(jm + 1))q/2

FIGURE 5.—Average wages associated with excess earnings of the program group.

Assuming that hit ≥ 0foralli, mt is a weighted average of the wages earned by the people in the program group in month t, with weights proportional to the increase in hours caused by the SSP program.21 Figure 5 plots estimates of this ratio, along with a 95% confidence inter- val (estimated by the delta method). To account for differences in the min- imum wage over time and across the two provinces, we have divided the measured wage of each individual in each month by the prevailing minimum wage. The wage measure is therefore expressed in “minimum wage units,” with a value of 1 implying that the average marginal wage is equal to the minimum wage. Inspection of the graph suggests that the average marginal wage is very close to the minimum wage, with no systematic trend, although the confidence intervals are rather wide after about month 30 (reflecting the small denomi- nator of the ratio). This reinforces the conclusion from Figure 4 that the extra hours of the SSP program group were paid at wages very close to the minimum wage.22

21 The assumption that hit ≥ 0 is identical to the monotonicity assumption required for the interpretation of local average treatment effects in an instrumental variables context (Imbens and Angrist (1994)). The full time hours limit for SSP was imposed to prevent the kinds of hours reductions often attributed to negative income tax programs. In light of this rule and the fact that sample members are all single parents, we believe that the assumption of positive hours effects is reasonable. 22If SSP causes people who would have worked anyway to select jobs with different wage rates, the interpretation of the average marginal wage is more complicated. Arguably, SSP provides an incentive to take a more stable job or one with higher hours. If these jobs pay lower wages, the estimated average marginal wage will be negatively related to the fraction of people in the program group who are choosing different jobs (but would have worked anyway). The fact that 1738 D. CARD AND D. R. HYSLOP

The absence of a trend in the average marginal wage relative to the minimum wage is important because it suggests that the SSP program group experienced little or no relative gain in potential wages over the course of the experiment. This is confirmed by the analysis in Table III of labor market outcomes in the last available month (month 52). Recognizing the higher average level of wages in one of the two provinces (British Columbia), we present data for the overall sample and separately by province.23 By month 52 there is no significant gap

TABLE III SUMMARY OF LABOR MARKET OUTCOMES 52 MONTHS AFTER RANDOM ASSIGNMENTa

Both British New Provinces Columbia Brunswick Control group outcomes in month 52 Percent employed 4156 3919 4408 (102)(141)(148) Percent with reported wage 3826 3563 4108 (101)(138)(146) Mean log hourly wage 217 236 199 (001)(002)(002) Cumulative employment since 141 133 149 random assignment (in years) (003)(004)(005) Program group outcomes in month 52 Percent employed 4169 3773 4605 (100)(136)(147) Percent with reported wage 3945 3504 4431 (099)(134)(147) Mean log hourly wage 215 234 199 (001)(002)(002) Cumulative employment since 168 155 182 random assignment (in years) (003)(004)(005) Difference: Program group − Control group Percent employed 013 −146 197 (143)(196)(208) Percent with reported wage 119 −058 323 (141)(192)(207) Mean log hourly wage −002 −002 −001 (002)(003)(002) Cumulative employment since 028 022 033 random assignment (in years) (004)(006)(007)

Continues the estimated marginal wage is roughly constant over the entire 52-month period suggests that any impact on wages of those who would have worked regardless of SSP is small. 23Wages for the labor market as a whole are 20–30% higher in Vancouver than in the areas included in the New Brunswick sample. The minimum wage varies by province and is typically 25% higher in British Columbia than New Brunswick; e.g., $5.00 per hour in New Brunswick in 1993 versus $6.00 per hour in British Columbia. EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1739

TABLE III—Continued

Both British New Provinces Columbia Brunswick Regression models for outcomes in month 52 Reduced form equations (a) Program group effect in model for log wage −001 −002 −0002 (002)(003)(002) (b) Program group effect in model for cumulative work 037 028 046 (fitted to subsample with reported wage) (005)(008)(007) Effect of cumulative work on wage in month 52 (c) Estimated by OLS 0049 0046 0051 (0007)(0012)(0009) (d) Estimated by IV, using program group status −0032 −0088 −0004 as instrument (0045)(0099)(0046)

aStandard errors in parentheses. The sample includes 2,339 in the control group and 2,418 in the program group with complete employment data for 52 months after random assignment. Regression models in the bottom panel are fitted to subgroups of 895 control group members and 954 program group members with reported wage in month 52. Other covariates in regression models include year dummies, education, experience, high school completion dummy, immigrant status, age, indicators for working or looking for work at random assignment, and indicators for physical or emotional problems that limit work (measured at random assignment). See text. between the program and control groups in the fraction of people working or reporting a wage. Indeed, in one province the program group has a slightly lower employment rate than the control group, while in the other the pattern is reversed, although in neither case is the difference significant. Mean wages are also very similar in the program and control groups. This may seem a little surprising given the extra work effort by the program group over the previ- ous 51 months. Indeed, as shown in the table, we estimate that program group members worked a total of 0.28 years more than control group members be- tween random assignment and month 52. Recall, however, that the sample had about 7 years of work experience at random assignment. Evidence on the re- turns to experience for less skilled female workers suggests the marginal impact of 0.2–0.3 years of work experience for such a group is small—on the order of 1–2% (Gladden and Taber (2000))—and probably undetectable. The bottom panel of Table III presents results from a series of regression models that evaluate the impacts of being in the SSP program group on wages and cumulative work experience in the 52nd month of the experiment. These models are fitted to the subsamples of control and program group members with wage data in month 52, and include time dummies, province dummies (in the models that pool the two provinces), and a set of covariates that repre- sent pre-random-assignment characteristics. A possible concern with the mod- els is selectivity bias, since the sample is conditioned on reporting a wage in month 52. However, given equality in the fractions of the program and control groups with a wage, and the similar characteristics of the employed subgroups, a conventional control function for selectivity bias would have the same mean 1740 D. CARD AND D. R. HYSLOP value in the two groups and is, therefore, orthogonal to the program group dummy.24 The estimates in row (a) of the lower panel show that the difference in mean log wages between the program and control groups is small and statis- tically insignificant. By comparison, the estimates in row (b) show that among those working in month 52, members of the program group have significantly greater cumulative work experience than members of the control group. In rows (c) and (d) we present models in which cumulative work experience is included as an additional explanatory variable for wages in month 52. As shown in row (c), a model that ignores the potential endogeneity of cumulative experience yields a relatively large and precisely estimated effect of work expe- rience, on the order of 5% per year. When program group status is used as an instrument for cumulative work experience, however, the estimated effect be- comes slightly negative but insignificant. Since the IV estimate is numerically equal to the ratio of the coefficients in rows (a) and (b), this is just a restate- ment of the fact that although the program group had greater cumulative work experience, they had marginally lower wages. We have also examined the entire distributions of wages in month 52 for the program and control groups, and found no significant differences between them. The 10th, 25th, 50th, 75th, and 90th percentiles of the two distributions are quite similar and statistically indistinguishable. Likewise, a nonparametric rank test for equality of the distributions is insignificant. Overall, the work ex- perience attributable to SSP appears to have had no detectable effect on wage opportunities. As we noted earlier, this is not very surprising given the mean level of experience among the single mothers in the experimental sample and existing evidence on the modest returns to added experience for lower skilled women.

2. INTERPRETING THE IMPACT OF SSP ON WELFARE PARTICIPATION As noted earlier, the time limit to establish eligibility for subsidy payments suggests that SSP had different incentive effects before and after the estab- lishment of eligibility. Furthermore, for those who established eligibility, the date of entitlement potentially varies nonrandomly. Consequently, although standard experimental comparisons between the treatment and control groups remain valid, the interpretation of such impacts is confounded by the different treatment effects associated with these two sets of incentives. In this section we

24As noted in Ahn and Powell (1993), conventional single index models imply that the selec- tion bias in the observed mean for a censored outcome is a monotonic function of the degree of censoring, conditional on the exogenous covariates. We evaluated the similarity of the char- acteristics of program and control group members who reported a wage in month 52 by running a regression model among wage reporters to predict program group status. The model included 24 pre-random-assignment characteristics and interactions. The model has insignificant explana- tory power (probability value of F-test = 0.94), suggesting there are no differences in the observed characteristics of the two groups. EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1741 focus on modeling the impact of SSP on welfare participation with the objec- tive of disentangling these effects. Our approach has three components. First, to help clarify the incentive ef- fects of SSP and guide the formulation of our empirical model, we outline a simple dynamic model of work and welfare participation in the presence of SSP. Second, to provide a baseline model for welfare participation in the ab- sence of SSP, we estimate a set of dynamic logistic models for the SSP control group that include unobserved heterogeneity and state dependence. Third, we combine the insights from the theoretical model with the baseline empirical model for welfare participation to develop an econometric model for estimat- ing and interpreting the impacts of SSP. To account for possible selectivity of the SSP-eligible subgroup, we model welfare participation and the determi- nation of eligibility for SSP payments jointly. Crucial to this formulation is the maintained assumption, based on randomization of SSP participants into treat- ment and control groups, that the distribution of unobserved heterogeneity is the same across these groups. This assumption allows us to identify the treat- ment effects of SSP by comparing the welfare participation rates of the treat- ment group and the control group, conditional on the date of SSP eligibility and the unobserved heterogeneity distribution.

A. A Simple Benchmark Model for the Behavioral Impacts of SSP We begin by presenting a simple dynamic model of work and welfare partic- ipation in the presence of SSP.25 The model is a standard discrete time search model (e.g., Mortensen (1977, 1986)) in which a single parent has two options: full-time employment or welfare participation. Welfare pays a monthly bene- fit $b and yields a flow payoff of b. Full-time employment at a monthly wage of $w yields a flow payoff of w − c,wherec reflects the disutility of work rela- tive to welfare (including child care costs, work expenses, the value of foregone leisure, and potential stigma effects). Individuals maximize expected future in- come using a monthly discount rate of r. To keep the model as simple as pos- sible, we assume that each month an individual receives a single job offer with probability λ, and that the arrival rate of offers is the same for workers and nonworkers. Wage offers are drawn from a stationary distribution with density f(w) and cumulative distribution F(w). Finally, we assume a constant rate of job destruction δ, which applies to new as well as existing jobs. A key simplifying assumption is that wage opportunities do not depend on previous work effort. Based on the results in Table III we believe this assump- tion is reasonable. In fact, the evidence in Figures 4 and 5 suggests that, for most people who were offered SSP, the key issue was whether to accept a min- imumwagejobornot.

25A more complete description of the model is provided on the Supplemental Material website. 1742 D. CARD AND D. R. HYSLOP

Optimal behavior in the absence of a wage subsidy program is characterized by a stationary value function U(w) that gives the discounted expected value associated with a job paying wage w and a value V 0 of nonwork (i.e., welfare participation). People who are employed at a wage w accept any offer paying more than w. People who are on welfare follow a reservation-wage strategy and accept any job paying more than R, the (fixed) reservation wage satisfying U(R) = V 0. Under the assumptions of the model it is readily shown that the optimal reservation wage is R = b + c.26 This model predicts that welfare transitions in the absence of SSP are de- termined by a combination of the arrival rate of job offers, the rate of job destruction, the level of welfare benefits, the distribution of wages, and the pe- cuniary and nonpecuniary costs of work. Specifically, the exit rate from welfare is λ(1 − δ) × (1 − F(b+ c)), while the entry rate is δ. Individual heterogeneity in welfare exits arises from variation in λ, δ, c, and in the location of the wage offer distribution relative to the welfare benefit level. Individual differences in welfare entry rates arise from heterogeneity in δ. If an SSP subsidy is made available at time 0, an individual currently on wel- fare has to evaluate three separate value functions: Vi(t), the value of not work- ing in month t, conditional on not yet having established eligibility; Ue(w d), the value of a job paying a wage w conditional on SSP eligibility with d months of elapsed eligibility; and Ve(d), the value of not working conditional on eligi- bility and d months of elapsed eligibility. The rules of SSP provide a link be- tween these functions and the value functions in the absence of the program. 0 In particular, Vi(t) = V for t ≥ 13, since those who fail to find full-time work within 12 months of being offered the subsidy lose all future eligibility. In addi- tion, Ue(w d) = U(w) for all d>36, since subsidy payments are only available 0 for 3 years, while Ve(d) = V for all d ≥ 36. A revealed preference argument establishes that Ue(w d) > U(w) for all w and any d ≤ 36, since the subsidy paid to a worker earning a wage w is strictly positive. From this it follows that Vi(t) is decreasing in t (since the passage of time leaves less time to estab- lish eligibility), and that Ue(w d) and Ve(d) are both decreasing in months of elapsed eligibility (since the entitlement period is finite). As is true in the absence of the subsidy, people who are working and eligible for the supplement accept any job offer that pays more than their current wage, while those who are on welfare with d months of elapsed eli- gibility follow a reservation-wage strategy with a reservation wage Re(d),with Ve(d) = Ue(Re(d) d). Since people can quit jobs that are no longer acceptable once their SSP eligibility ends, it is straightforward to show that the optimal reservation wage for an SSP-eligible nonworker equates the net income from a reservation-wage job to the flow value of welfare, b + c.Sinceb and c are

26If on-the-job and off-the-job search are equally productive, there is no reason to turn down a job yielding flow value (w − c) greater than the flow value of welfare (b). Hence the reservation wage is the income equivalent of welfare, b + c. EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1743

fixed, Re is independent of d and is defined by the equality Re + s(Re) = b + c, where s(w) is the subsidy for working at a wage rate w.27 Individuals who are still on welfare in month t and not yet SSP-eligible have a reservation wage R(t) that satisfies the condition Vi(t) = Ue(R(t) 1).From this equality and the fact that Vi(t) is decreasing in t, it follows that the reser- vation wage R(t) is decreasing in t: individuals with fewer months of potential eligibility left will accept lower wage jobs. Moreover, the reservation wage in the first month of potential eligibility, R(1), is strictly less than the reservation wage once eligible, since a full-time job for someone who is not yet eligible provides the same flow benefits as for someone who is eligible, and in addition guarantees future eligibility. Thus, Re >R(1) ≥ R(2) ≥···≥R(12). The effects of SSP on the welfare/work decision can be summarized by the difference between the reservation-wage profiles of a representative welfare recipient in the presence or absence of SSP. Figure 6a shows the sequence of reservation wages for a person who is offered SSP but fails to establish eligi- bility, along with the (constant) reservation wage R = b + c in the absence of the program. During the 12-month window that individuals have to establish eligibility, the reservation wage is below R and declining. At the close of the window those who failed to find a job revert to the reservation wage in the ab- sence of the program. Figure 6b shows the sequence of reservation wages for

FIGURE 6a.—Reservation wage of ineligible program group member.

27Since s(w) ≥ 0, the reservation wage for SSP eligibles is below the reservation wage R in the absence of the program. Indeed since R = b + c,wehavethatRe + s(Re) = R. Note that Re could be below the minimum wage. However, SSP rules required participants to earn at least the minimum wage. 1744 D. CARD AND D. R. HYSLOP

FIGURE 6b.—Reservation wage of eligible program group member.

a person who is offered SSP and establishes eligibility in month te ≤ 12. Prior to te the reservation wage is declining. At te the reservation wage jumps up and remains constant until the end of the entitlement period at a value that satisfies the condition Re + s(Re) = b + c. After SSP entitlement ends in month te + 36, the reservation wage reverts to b + c. The path of the optimal reservation wage illustrates the two different incen- tive regimes experienced by those exposed to SSP. During the pre-eligibility period (up to the establishment of eligibility in month te, or 12 months af- ter random assignment for those who do not establish eligibility), members of the treatment group have a low and declining reservation wage, leading to a faster rate of transition from welfare to work than would be expected in the absence of SSP. Those who establish eligibility then adopt a somewhat higher reservation wage, but still lower than the one in the absence of the program, implying that they are more likely to leave welfare and reenter work than oth- erwise similar members of the control group. The jump in the reservation wage at te implies that some people who accepted low-paying jobs to gain eligibility would be expected to quit and return to welfare almost immediately. Once SSP eligibility ends (36 months after establishing eligibility, or starting in month 12 for those who do not establish eligibility), the reservation wage returns to its level in the absence of the program and the behavioral effects of SSP disap- pear. Again, as a result of the jump in the reservation wage at the close of eligibility, people holding jobs that pay less than the reservation wage in the absence of SSP would be expected to quit and reenter welfare (consistent with the patterns in Figure 2b). While this stylized model provides a guide to the potential effects of SSP, it is clearly oversimplified. For example, the model assumes that the cost of work (c) is constant. More realistically, work costs evolve over time (e.g., if EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1745 a child becomes sick), leading people to revise their reservation wages and quit some jobs that were previously acceptable. An earnings subsidy widens the range of cost fluctuations that can be tolerated at any wage, leading to a reduction in the flow from work back to welfare. Another limitation is the assumption that people either work full time or receive welfare. In fact, some people leave welfare without entering full-time work. In our empirical model, we therefore have to distinguish between leaving welfare and becom- ing SSP-eligible. The model also ignores the possibility that the cost of work is affected by previous work experience. A habit persistence model, for example, might imply that individuals who work more when SSP is available eventually lower their reservation wages. While endogenous taste formation is sometimes mentioned as a welfare trap mechanism, the evidence from the SSP experiment is not particularly favorable to this story, since by month 52, just a few months after the end of subsidies, the employment rate of the program group had con- verged to the rate of the control group. The model also ignores wealth effects and intertemporal substitution effects, which could lead to a negative impact on the probability of work in the period immediately after the end of SSP.

B. Models of Welfare Participation in the Absence of SSP We begin our empirical analysis by estimating a series of models of welfare participation for the SSP control group. Our objective here is to formulate a statistical model for welfare participation in the absence of SSP that will pro- vide a suitable baseline counterfactual for comparing welfare outcomes under the SSP program. We adopt a panel data approach rather than a hazard model- ing approach because of the high incidence of multiple spells in our data (over half the sample have two or more spells on welfare) and the need to specify a tractable (low dimensional) model of the effects of unobserved heterogeneity on welfare transition rates and SSP eligibility.

Let yit representanindicatorthatequals1ifpersoni receives IA in month t (where t runs from 1, the first month after random assignment, to T = 69) and

let xi1xiT represent a sequence of observed covariates for individual i.We consider models in the class

(1) P(yi1yiT |xi1xiT ) = L αi + xit β + (γ10 + γ11αi)yit−1 + (γ20 + γ21αi)yit−2 t + (γ30 + γ31αi)yit−1yit−2 f(αi)dαi 1746 D. CARD AND D. R. HYSLOP where L(·) represents the logistic distribution function and f(αi) represents the density of unobserved heterogeneity among the experimental population.28 We consider two alternative specifications for f(·). In the first case we assume that f(αi) = φ(·; σα), the normal density with mean 0 and standard devia- tion σα. The integral on the right-hand side of equation (1) can then be ap- proximated by the method of Gaussian quadrature.29 As an alternative, we assume that f(·) is a discrete distribution with a small number of mass points, and estimate the location of the mass points and their relative probabilities. Equation (1) describes a logistic regression model with second order state dependence and a random effect. Although our benchmark theoretical model suggests a first order state dependence specification, we show below that a second order specification leads to a considerable improvement in model fit. There are a surprising number of single-month spells on or off welfare, and the key assumption of a first order model—that exit or entry rates are independent of the length of the current spell—is clearly violated. A second order specifi- cation allows the transition rate in the first month of a spell to differ from the transition rate in subsequent months and is more consistent with the data.30 Note also that because the experimental sample is drawn from the population of long-term welfare participants (and we drop the small number of individ- uals who left IA between the time they were selected for the sample and the time they were contacted for random assignment), there is no initial conditions 31 problem in estimation of (1). At the same time, the distribution f(αi) repre- sents the distribution of heterogeneity in the population of long-term welfare recipients, rather than the distribution for a more general population. The first three columns of Table IV present estimation results and diagnos- tic statistics for versions of equation (1) with normal heterogeneity. The only covariates are a fourth order polynomial in time since random assignment.32 The model in column 1 assumes first order state dependence, while the model

28Chay and Hyslop (2001) have found that logistic models with state dependence and unob- served heterogeneity fit welfare behavior about as well as more computationally demanding mul- tivariate Probit models that allow for serial correlation in the transitory component of welfare participation. 29As noted by Butler and Moffitt (1982), the likelihood for models in the class of equation (1) when f(·) is the normal density have the form g(x)exp(−x2)dx, which can be approximated by the sum i wig(xi),whereg is evaluated at a fixed set of N points (xi) and the sum is formed with a fixed set of weights (wi). We use N = 10 points; see Abramowitz and Stegun (1965, p. 924). 30An alternative would be to ignore 1-month spells by “smoothing” over such spells. 31Everyone in the sample has identical IA status in the “presample” months. On the general importance of modeling the distribution of initial conditions in dynamic models, see Heckman (1981). 32We have fitted a variety of models that include fixed baseline covariates such as province, education, and gender, as well as time varying indicators for calendar time. In these models sev- eral of the covariates are statistically significant and absorb some of the variance attributed to the random effect. However, the ability of models with controls for observable heterogeneity to fit the distribution of observed welfare histories is insignificantly different from models that treat all heterogeneity as unobserved. One of the few covariates that has much explanatory power is EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1747

TABLE IV ESTIMATED DYNAMIC MODELS FOR IA PARTICIPATION OF CONTROL GROUPa

Models with Normally Distributed Model with Mass-Point Random Effect Distribution of Random Effect

(1) (2) (3) (4) State dependence parameters γ10 522 519 476 465 (003)(007)(006)(010) γ20 —219 203 184 (005)(005)(007) γ30 — −139 −089 −087 (008)(008)(009) γ11 ——−070 −093 (007)(002) γ21 ——−028 −058 (004)(003) γ31 ——081 061 (008)(004) Standard deviation of 164 132 157 4 mass points random effect, σα (003)(003)(006) Log likelihood −28,276.0 −27,225.6 −27,202.6 −27,067.4 Goodness of fit 752.6 260.3 253.0 175.8 Generalized residuals Mean −002 000 000 000 Variance 111 095 097 098 1st order correlation −007 001 000 000 2nd order correlation 003 −002 −002 −003 3rd order correlation 003 000 000 000 4th order correlation 004 001 001 000 5th order correlation 003 001 001 000

aStandard errors in parentheses. See text for model specifications. All models include fourth order trend. Models in columns 1–3 are estimated by maximum likelihood using Gaussian quadrature with 10 points. The model in column 4 has 4 mass points. Goodness of fit and diagnostic tests for generalized residuals are explained in text. in column 2 allows second order state dependence; both restrict the degree of state dependence to be invariant to the unobserved heterogeneity (i.e., γk1 = 0, for k = 1 2 3). The second order terms are highly significant, and their pattern implies that, after controlling for the permanent component of welfare partici- pation, welfare transition rates are higher for those who have only been in their current state for 1 month than for those who have been in the state for 2 or more months. The model in column 3 includes interaction terms between the

an indicator for having at least a high school education. However, as shown in Figure A in our appendix of additional results on the Supplemental Material website, the actual impacts of SSP on IA behavior are not much different between high school graduates and dropouts. 1748 D. CARD AND D. R. HYSLOP state dependence parameters and the random effects (i.e., the γk1 terms in equation (1)). This relaxes the “linear in log odds” assumption of the logistic functional form and permits the degree of state dependence to vary by whether individuals have a higher or lower long-run propensity to participate in welfare. The interaction terms are statistically significant and their addition leads to a noticeable improvement in the likelihood of the model. The sign pattern of the interactions implies that the state dependence effects are larger for those who are less likely to be on welfare in the long run. How well do these models explain observed welfare outcomes? As a descrip- tion of average IA participation rates, the answer is very well. The time path of welfare participation predicted by any of the models is fairly close to the actual path. This is not surprising, however, given that the models include a fourth order polynomial trend and that the control group’s welfare profile is fairly smooth. A more difficult challenge is to predict the distribution of welfare his- tories among the control group.33 To evaluate the models on this dimension we compare the predicted and actual fractions of the control group in a set of mutually exclusive cells defined by the total months on IA since random assign- ment, the number of welfare transitions, and whether the number of transitions is odd (in which case the individual ends up off IA) or even (in which case she ends up on IA). The cells used in our comparisons, along with the actual and predicted numbers of observations from the SSP control group in each cell, are shown in Table V. We selected the cells to yield reasonable cell sizes: thus, we grouped welfare histories with 0–2, 3–8, 9–14, ...totalmonths on IA, with separate cells for 68 or 69 months on IA. Overall, we collapsed the 269 possible welfare histories into 50 cells. For each of the models in Table IV we constructed a chi-squared statistic based on the deviation between the predicted and actual number of observa- tions in each cell.34 Allowance for second order state dependence leads to a considerable improvement in the ability of the model to predict the distribu- tion of welfare histories (compare fit statistics in column 1 and column 2). By comparison, the addition of the interaction terms in column 3 leads to only a modest additional improvement in fit. The upper panel of Table V compares the actual (in bold) and predicted (in italic) distributions of the SSP control group across the 50 cells, using the model from column 3 of Table IV. A prominent feature of the data is the large

33The idea of comparing predicted and actual frequencies from multinomial probability mod- els is discussed in Moore (1977), and is used in Card and Sullivan (1988) and Chay and Hyslop (2001). We construct predicted cell fractions by simulating each model with 10 replications per sample member. 34 − 2 We constructed the standard Pearson statistic: j (Oj Ej ) /Ej ,whereOj is the number of observed cases in cell j = 1J and Ej is the expected number. Since the expected number is based on a model fit to the same data, the statistic does not necessarily have a chi-squared distribution with J − 1 = 49 degrees of freedom. We interpret the goodness of fit statistics as informal summary measures of fit. EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1749 4 1 6 6 0 3 2 0 5 6 7 9 0 3 8 64 147 145 137 145 139 153 156 168 201 231 324 786 224 546 , 2 Continues 41 a 167 147 144 150 169 174 182 185 166 200 257 200 604 2,786 2 1 0 7 7 5 2 7 8 9 2 0 0 0 1 3 5 9 0 0 15 20 32 45 75 REDICTIONS 135 345 P + 4 ODEL 0 0 3 6 7 0 0 13 18 29 29 33 53 M 113 304 9 0 1 1 6 2 1 9 2 3 9 0 3 0 0 5 0 0 45 71 83 93 92 99 97 96 97 77 49 908 + 3 OMPARISONS TO C 3 0 0 40 52 64 72 83 90 87 83 82 77 40 773 3 6 7 0 7 5 4 4 5 5 5 6 7 4 ROUP WITH 0 1 1 2 2 4 6 8 0 G 11 23 42 95 211 412 Number of Transitions TABLE V ONTROL 0 2 5 2 5 3 7 8 0 C 18 10 30 67 189 346 2 3 7 5 0 9 2 5 1 0 4 1 6 5 0 58 71 49 43 32 32 29 28 35 35 45 12 ATTERNS OF 100 573 P 0 38 87 72 66 70 59 58 55 41 40 37 11 125 759 ARTICIPATION 8 8 IA P 0 0 0 0 0 0 0 0 0 0 0 0 0 546 546 0 1 2 Even Sum Odd Sum Total 0 0 0 0 0 0 0 0 0 0 0 0 0 604 604 UMMARY OF S Months on IAActual and predicted cell fractions Actual0–2 from model in Table IV, column 3 (normal heterogeneity) Predict. Actual Predict. Actual Predict. Actual Predict. Actual Predict. Actual Predict. 3–8 9–14 15–20 21–26 27–32 33–38 39–44 45–50 51–56 57–62 63–67 68 69 Total 1750 D. CARD AND D. R. HYSLOP 2 9 8 8 7 2 1 1 7 2 3 8 0 7 4 49 123 139 150 160 171 176 198 191 190 219 203 786 190 620 , 2 he column 41 167 147 144 150 169 174 182 185 166 200 257 200 604 2,786 8 2 1 0 2 9 9 4 3 7 3 9 0 0 1 3 8 0 0 12 19 25 34 47 94 116 363 + 4 0 0 3 6 7 0 0 13 18 29 29 33 53 113 304 5 7 5 6 1 7 7 6 0 7 8 1 0 0 0 2 0 0 23 41 58 73 86 97 95 76 40 111 106 814 + 3 3 0 0 40 52 64 72 83 90 87 83 82 77 40 773 1 8 5 2 2 6 1 7 2 8 5 3 1 1 0 1 2 3 4 5 7 8 9 0 Continued 11 18 28 184 285 Number of Transitions 0 2 5 2 5 3 7 8 0 18 10 30 67 189 346 TABLE V— 6 6 6 9 4 7 4 9 1 4 3 1 6 6 6 0 46 97 94 85 75 66 51 51 42 35 29 19 702 0 38 87 72 66 70 59 58 55 41 40 37 11 125 759 4 4 0 0 0 0 0 0 0 0 0 0 0 0 0 620 620 0 1 2 Even Sum Odd Sum Total 0 0 0 0 0 0 0 0 0 0 0 0 0 604 604 Bold entries represent number of observations with the number of months on IA given in the row heading and the number of transitions off or on IA given in t a Months on IAActual and predicted cell fractions Actual0–2 from model in Table IV, Column 4 (mass point Predict. heterogeneity) Actual Predict. Actual Predict. Actual Predict. Actual Predict. Actual Predict. 3–8 9–14 15–20 21–26 27–32 33–38 39–44 45–50 51–56 57–62 63–67 68 69 Total heading. Italic entries represent the predicted number of observations with the same IA participation history. EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1751 number of people (604 = 217% of the group) who were on IA continuously. The model underpredicts the size of this group (predicted number = 546.8). The control group also includes a relatively large number of people who left welfare for 1 month and then returned (these are the 189 = 68% of the sam- ple with 68 months on IA and 2 transitions). The model actually overpre- dicts the size of this group (predicted number = 211.7). Looking down the two right-hand columns, a second order model with normal heterogeneity provides reasonably accurate predictions for the distribution of total months on IA, with the exception of the last three groups (63–67 months on IA, 68 months on IA, and 69 months on IA). The model overpredicts the fractions with 63–67 or 68 months on IA and underpredicts the fraction with 69 months on wel- fare. A computationally feasible alternative to normally distributed heterogene- ity is the assumption that the random effects have a point mass distribution (Heckman and Singer (1984)). Column 4 of Table IV shows estimation results for one such model, with 4 mass points. We also computed models with 5 and 6 mass points, but found relatively little improvement in either the log likelihood or the goodness of fit statistics relative to the 4 mass-point model. Interest- ingly, the estimates of the state dependence coefficients are relatively similar in columns 3 and 4, although the mass-point model has a somewhat higher log likelihood. In comparing different mass-point models, we observed that once we allow for three or more points of support, one of the mass points tends to infinity (i.e., a value of 16 or more). Taken literally, this means there is a subgroup of “pure stayers” in the data who never leave welfare. This feature leads to an im- provement in the ability of the model to fit the distribution of welfare histories, as shown by the chi-squared statistics in Table IV and by the comparison of the actual and predicted distributions of welfare histories from this model in the lower panel of Table V. Another set of diagnostic statistics for the different models is presented in the bottom rows of Table IV. These are the estimated means, variances, and 1st–5th order autocorrelations of the generalized residuals from the different specifications. The generalized residual for person i in month t, evaluated at a given value of the random effect, is

1/2 rit(α) = (yit − pit)/[pit(1 − pit)] where pit = pit(α; β γ xityit−1yit−2) is the predicted probability of welfare participation, conditional on xit yit−1yit−2, the parameters β and γ, and the value of random effect. Note that if the model is correctly specified, at the 2 true value of the random effect for person i E[rit(αi)]=0, E[rit(αi) ]=1, and E[rit(αi)rit−j(αi)]=0. Thus a potential specification check is to compute the sample analogue of one of these statistics (averaging over i and t) and com- 1752 D. CARD AND D. R. HYSLOP pare the result to the expected value under the null hypothesis of a correctly specified model. The true value of the random effect for any given person is unknown. However, given the likelihood function, the marginal distribution of the random effects, and the observed sequence of data for individual i,wecan compute the posterior distribution for the random effect for that individual.35 We therefore evaluate the expectations using this posterior, and average the values of the resulting statistic across the entire sample. For the mass-point heterogeneity model, the posterior has only four points of support and the calculation is straightforward. For the normal heterogeneity models, we use a simulation approach, drawing 20 values of the random effect for each person and computing the posterior distribution for a given person over this set. The residual statistics for the first order model in column 1 show clear evi- dence of misspecification: the average variance is 1.11, rather than 1, and the 1st order autocorrelation is −0.07. The statistics for the other models are con- siderably better, although all three show a small negative value for the sec- ond order autocorrelation. These results suggest that models with unobserved heterogeneity and second order state dependence provide a reasonably good description of the welfare participation data, with relatively little serial corre- lation in the prediction errors from the models.

C. Models of Welfare Participation for the SSP Program and Control Groups We now turn to the specification of a model for the SSP program group. Building on the insights of the simple theoretical model and the rules of the SSP program, we include two separate treatment effects in the dynamic wel- fare participation model. The first is the essentially mechanical effect on wel- fare participation associated with the rules for establishing eligibility for SSP that required recipients to leave IA. The second effect is caused by the fact that members of the eligible subgroup have a greater incentive to choose work over welfare in any given month of their entitlement period. The key to distin- e 36 guishing these effects is the date that SSP eligibility is established, ti . For any member of the program group we identify up to four distinct phases: e − (1) the pre-entitlement period, ending at ti 1 for those who establish eligibility e in month ti and ending at the close of the eligibility window for those who do not; (2) the transitional period for those who establish eligibility, lasting

35Let (y|α) denote the likelihood for the sequence of welfare outcomes, conditional on a value of the random effect, the covariates, and the other parameters, and let f(α) denote the marginal distribution of the random effects. The posterior distribution of the random effects for apersonwithoutcomesy is f(α|y) = (y|α)f (α)/ (y|α)f (α)dα 36We discuss the actual measurement of this date in the Appendix. Also, we allow the eligi- bility window to last for 14 months rather than the 12 months specified in the SSP rules, due to processing delays, dating errors, and/or leniency in the application of the rules. EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1753 for 3 months after the establishment of eligibility,37 when SSP program rules required newly eligible members of the program group to leave IA; (3) the e + entitlement period, lasting from the end of the transitional period to ti 36; and (4) the post-entitlement period, when supplement payments were no longer e + available, beginning at ti 37 for those who became eligible for SSP and at the close of the eligibility window for those who did not achieve eligibility. Let Eit represent an indicator for the event that individual i is eligible for SSP as of the start of month t. Note that the sequence {Eit } makesatmosta e single transition from 0 to 1 and that this occurs in the eligibility month ti (i.e., e = { = } ti mint Eit 1 ). We assume that IA participation and the SSP-eligibility indicators are correlated through their joint dependence on the unobserved heterogeneity component αi: (2) P(y y E E |x x ) i1 iT i1 iT i1 iT

= P(yitEit|yit−1yit−2Eit−1Eit−2xitαi) t

× f(αi)dαi

Using the fact that treatment status is randomly assigned, we also assume that the distribution of unobserved heterogeneity is the same for the program and the control groups. Conditional on αi and the covariates xit , we assume that Eit is determined independently of current or lagged IA status, while yit depends on current el- igibility, how long an individual has been eligible, and on two lags of previous IA status.38 Specifically, we assume

(3) P(yitEit|yit−1yit−2Eit−1Eit−2xitαi)

= P(Eit |Eit−1Eit−2xitαi)

× P(yit|yit−1yit−2EitEit−1xitαi) We adopt the specifications in columns 3 and 4 of Table IV as the baseline models for the control group, and assume that IA participation of the pro- gram group follows this model with the addition of a set of treatment effects

37The Appendix discusses the rationale for choosing 3 months for the transition period dura- tion. 38If people always stayed on IA until finding a full-time job and if all full-time jobs satisfied the SSP eligibility conditions (as assumed in our theoretical model), then the transition to eligibility would correspond with the time of first exit from IA. In fact, some people leave IA without moving to full-time work—for example, those who move in with a partner. A more complete model might deal with partnering as a “competing risk” that absorbs some welfare-leavers. On average, the hazard of becoming eligible for SSP is lower for people who are off IA or have previously left IA than for those still on IA. 1754 D. CARD AND D. R. HYSLOP that depend on which of the four phases the individual is currently occupying. Specifically, we assume that

e (4) P(y |y − y − E t x α) it it 1 it 2 it i it i = L αi + xitβ + (γ10 + γ11αi)yit−1 + (γ20 + γ21αi)yit−2 + + + e (γ30 γ31αi)yit−1yit−2 τ(tEitti yit−1) · e where L( ) represents the logistic distribution function and τ(tEitti yit−1) is the behavioral impact of SSP. We assume that the SSP establishment and enti- tlement treatment effects are confined to the transitional and the entitlement periods, respectively,39 and are constant within each of these periods, but allow separate effects depending on whether the individual was on or off IA in the previous month. We begin with specifications that assume the treatment effects are constant across individuals:

e τ(tEitti yit−1) = E × 1(te ≤ t ≤ te + J − 1) it i i × (ψ00 + ψ01αi)1(yit−1 = 0) + (ψ10 + ψ11αi)1(yit−1 = 1) + E × 1(te + J ≤ t ≤ te + 35) it i i × (λ00 + λ01αi)1(yit−1 = 0) + (λ10 + λ11αi)1(yit−1 = 1) where J (= 3) is the duration of the transition period, the ψ parameters mea- sure the “establishment” effects of SSP eligibility during the transitional period on individuals who were off or on IA in the previous month, respectively, and the λ parameters measure the corresponding incentive effects during the en- titlement period. Note that these specifications allow the treatment effects to vary linearly with the random effect. Given the nature of the eligibility process, a natural model for Eit is a hazard model for the event of establishing eligibility in month t, conditional on not establishing it earlier. We assume that the hazard of eligibility depends on the individual heterogeneity effect αi and on the time since random assignment,

(5) P(Eit|Eit−1Eit−2xitαi)   Φ[d(t) − k(αi)] if Eit−1 = 0andt ≤ Te, = =  1 if Eit−1 1, 0 if Eit−1 = 0andt>Te, where Φ is the standard normal distribution function, d(t) is a function of time, Te is the duration of the establishment period (14 months), and k(αi) is

39This assumption is consistent with the convergence of the IA participation rates of the pro- gram and controls groups after SSP payments ended, and is implied by the search model. EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1755 a simple function of the random effect. For the case of normally distributed heterogeneity, we assume that k(αi) is linear (i.e., k(αi) = k0αi). For the case of mass-point heterogeneity, we adopt a more flexible specification and as- sume that k(αi) takes on a different value for each mass point (with one value normalized to 0). Note that if the probability of establishing eligibility is inde- pendent of the individual-specific determinants of welfare participation, then k(αi) will be constant for all values of the random effect. Based on the com- parisons of welfare outcomes near the end of the sample period in Table II, however, we expect k(αi) to be positively correlated with αi. Although the causal effects of the entitlement incentives on welfare transi- tion rates are directly identified in this model through the λ parameters, the effects of the establishment incentives are embedded in the eligibility model and the establishment treatment effects, and are not directly identified. Since we observe the IA outcomes of the control group, however, we can infer the net effect of these incentives. In essence the eligibility model and the establishment-period treatment effects provide a joint model of the selection process leading to the entitlement-period data for members of the treatment group (including SSP-eligibility status, the date of entitlement te, and the IA participation state in month te + 3), and of the deviation between the wel- fare outcomes of the treatment and control groups in the early months of the demonstration.

D. Estimation Results for Combined Models Table VI presents estimates of alternative specifications of equations (1)–(5). All the models adopt the baseline specification used in Table IV, columns 3 (for normally-distributed random effects) and 4 (for mass point heterogene- ity), including second order state dependence, interactions between the state dependence effects and the random effects, and a fourth order trend in the IA participation model. The specifications in columns 1–4 assume normally dis- tributed random effects (corresponding to column 3 of Table IV), while the specification in column 5 uses a 4 mass-point specification (corresponding to column 4 of Table IV). As a reference point, the model in column 1 ignores any correlation between SSP eligibility and the random effect, while the other specifications include an eligibility model based on equation (5), with a trend 40 function d(t) = d0 + d1(t − 1) + d2/(t − 2). The specifications in columns 1 and 2 assume that the treatment effects are constant across individuals (i.e., ψk1 = 0λk1 = 0, k = 0 1), while the specifications in columns 3–5 allow the treatment effects to vary linearly with the random effects. Finally, the models

40The 1/(t − 2) term is included to capture the fact that the hazard of eligibility falls from 6% in month 3 to around 2.5% in months 4–10. 1756 D. CARD AND D. R. HYSLOP

TABLE VI ESTIMATED DYNAMIC MODELS FOR IA PARTICIPATION FOR CONTROL AND PROGRAM GROUPSa

Model with Mass- Point Distribution Models with Normally Distributed Random Effect of Random Effect

(1) (2) (3) (4) (5) State dependence parameters γ10 478 472 468 469 460 (004)(004)(004)(004)(006) γ20 190 186 184 184 169 (003)(003)(003)(003)(004) γ30 −087 −080 −079 −082 −077 (005)(005)(005)(005)(006) γ11 −074 −090 −077 −111 −093 (004)(004)(004)(007)(003) γ21 −040 −035 −033 −040 −055 (003)(003)(003)(004)(002) γ31 076 079 074 109 061 (005)(005)(004)(008)(003) Treatment parameters Transitional period ψ10 (exit) −302 −276 −326 −310 −282 (006)(006)(007)(007)(006) ψ00 (entry) −192 −163 −184 −175 −168 (013)(013)(014)(014)(014) ψ11 ——−053 −061 −010 (006)(007)(002) ψ01 ——−027 −022 −026 (009)(015)(005) Eligibility period λ10 (exit) −135 −110 −108 −111 −086 (004)(005)(004)(004)(005) λ00 (entry) −086 −051 −069 −072 −045 (005)(005)(005)(005)(006) λ11 ——−003 −007 035 (004)(006)(005) λ01 ——−026 −035 −006 (004)(007)(005) Selection parameters d0 — −223 −223 −222 −201 (006)(006)(006)(007) d1 —019 018 018 018 (004)(006)(006)(006) d2 —055 055 055 057 (008)(008)(008)(008) k0 —021 021 029 Mass-point (001)(002)(002) specific

Continues EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1757

TABLE VI—Continued

Model with Mass- Point Distribution Models with Normally Distributed Random Effect of Random Effect

(1) (2) (3) (4) (5) Interaction of random effect and trend Linear trend × α ———030 001 (004)(001) Quadratic trend × α ———−033 −001 (006)(001) Standard deviation of 175 176 186 118 4 mass points random effect, σα (004)(004)(004)(006) Log likelihood −57,018 −61,116 −61,032 −60,960 −60,779 Goodness of fit Controls 277.1 285.8 283.3 262.5 158.1 Programs 194.1 232.6 233.8 233.7 209.8

aStandard errors in parentheses. See text for model specifications. All models include fourth order polynomial trend. Models in columns 1–4 are estimated by maximum likelihood using Gaussian quadrature with 10 points. The model in column 5 has 4 mass points, with unrestricted mass points in the selection model. See text for further de- scription of model.

in columns 4 and 5 allow for individual heterogeneity in the trend in IA par- ticipation by including interactions of the random effect with a quadratic in months since random assignment. The models in Table VI yield estimates of the state dependence parame- ters that are similar to the estimates obtained for the control group alone. The estimated treatment effects are roughly similar across specifications, with large negative estimates of the transition-period establishment effects and smaller but significantly negative entitlement-period effects. A comparison of the treatment effects in columns 1 and 2, however, shows that the implied entitlement-period effects are about 30% larger when eligibility is treated as exogenous (column 1) than when it is modeled as endogenous (column 2). This is the pattern that would be expected if people with a lower probabil- ity of IA participation are more likely to become SSP-eligible. That is, in the model in column 2 some of the differential in entitlement-period transition rates between the eligible and ineligible program subgroups is attributed to the selectivity of eligibility status, whereas in the model in column 1 all of the dif- ference is assigned to a causal effect of SSP.Consistent with this interpretation, the estimates of the parameter k0 from the eligibility model are positive and highly significant in columns 2–4. The implication is that the distribution of the random effects among those who became eligible is much different than the distribution among the ineligibles. For example, simulations from the model in column 2 of Table V show that the median of the αi for the eligible program group is −0.98, while the median for the ineligible group is 0.36. (By assump- tion the mean and median of the αi is 0 for the overall population.) 1758 D. CARD AND D. R. HYSLOP

The bottom rows of Table VI show goodness of fit statistics that summarize each model’s ability to predict the distributions of the control and program groups across the 50 cells used in Table V. Perhaps unsurprisingly, the spec- ification in column 1, which treats eligibility status as exogenous, provides a slightly better fit than the specification in column 2, which treats it as endoge- nously determined, although the eligibility model is flexible enough to provide an accurate prediction of the fraction who achieved eligibility, and a reason- able fit to the distribution of months to eligibility.41 The model in column 3 generalizes the specification in column 2 by allow- ing the SSP treatment effects to vary with the random effects. The interaction term is especially large for the transitional-period effect on IA exits, and im- plies that SSP eligibility raised the log odds of leaving welfare more for people with higher values of the individual effect αi (i.e., those who were less likely to leave in the absence of the program). Indeed, the predicted probabilities of leaving IA conditional on attaining SSP eligibility from this more general model are roughly constant across the distribution of αi. Since most people exited IA at about the same time as they became SSP-eligible (see Figure 2a), the generalized model gives a better description than one that assumes a ho- mogeneous effect on the log odds. The specification in column 4 introduces an additional degree of flexibility by including interactions of αi with a quadratic in months since random assign- ment. We developed this model out of concern that imposing a homogeneous trend might inadvertently bias our estimated treatment effects, since the eligi- ble program group has a nonrandom distribution of αi. As with the other inter- action terms, the trend interactions are statistically significant, although their introduction has little effect on the size of the estimated treatment effects. The specification with trend interactions provides a slightly better goodness of fit for the program group than a comparable model without these terms, but im- plies very similar treatment effects. Finally in column 5 we adopt the same specification as in column 4, but re- place the assumption of a normal distribution for the random effects with the assumption of a 4 mass-point distribution. For each mass point we estimate a value for αi, a value for the constant in the eligibility model, and the fraction of the population associated with the point. As was true for models fitted to the control group only, the normal heterogeneity and 4 mass-point models pro- vide relatively similar parameter estimates, although the goodness of fit statis- tics are somewhat better for the mass-point model. Again one of the estimated

41The models in columns 2–5 of Table VI all give very accurate predictions for the fraction of the program group who achieved eligibility. Plots of the actual and predicted eligibility hazards (included in the Appendix materials available from the Supplemental Material website) also show a good fit apart from the last 2 months. The models underpredict the hazard in month 14 and overpredict the hazard in month 15. The root-mean-squared prediction error from the model in column 4, for example, is 0.37% (relative to an average hazard rate of 2.89%). Excluding the last two months, the root-mean-squared error is 0.08%. EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1759 mass points is essentially infinite, implying that some fraction of the popula- tion never leaves welfare. Interestingly, the fraction of “never leavers” is lower in the SSP program group than in the control group (17.1% versus 21.7%), suggesting that not all those who never left welfare in the control group were “pure stayers.” This creates something of a problem for the mass-point model, which overpredicts the fraction of never leavers in the program group and un- derpredicts the fraction in the control group. By including separate intercepts in the eligibility model for each mass point, the selection model in column 5 is considerably more general than the “one factor” model in columns 2–4. However, the estimated mass points in the welfare participation and eligibility models are very highly correlated (correlation = 094 across the 4 mass points), suggesting that the restriction embedded in our normal heterogeneity models may be relatively innocuous. Table VII compares the predictions from the models in columns 4 and 5 in Table VI for the distribution of welfare histories of the program group. Over- all, the fits are similar, although the goodness of fit statistic is a little better for the mass-point model. The two models also give very similar predictions for mean levels of IA participation in each month of the SSP experiment. In view of similarities between the estimates and predictions from the two models, we have reasonable confidence that our estimates are insensitive to the parame- terization of heterogeneity. Figure 7 shows predicted and actual IA participation rates for the program and control groups in the 69 months after random assignment, based on the normal heterogeneity model in column 4 of Table VI. (Predictions from the mass-point heterogeneity model are nearly identical.) Overall, the predictions are fairly accurate, although the model slightly overpredicts welfare partici- pation of the program group in the period around the close of the eligibility window (months 13–15) and also overpredicts IA participation rates of both groups in months 43–50. The model explains over 99% of the variance in av- erage monthly IA participation of both the program and control groups, with root-mean-squared prediction errors of 0.6 and 0.9%, respectively. (The cor- responding figures for the mass-point model in column 5 are 0.7 and 0.9%.) Further insight into the accuracy of the model is provided in Figure 8, which shows predicted and actual welfare participation rates for the eligible and ineligible program groups. The predictions for the ineligible group are rela- tively accurate (root-mean-squared error of 1.5%), while those for the eligi- ble group are a little less so (root-mean-squared error 2.6%), particularly in months 13–18. The model has particular difficulty reproducing the “dip” in welfare participation just after the close of the eligibility window. A closer look at the data around this point suggests that a relatively high fraction of those who achieved SSP eligibility near the end of the eligibility window returned to IA within a few months. Such behavior is consistent with our theoretical model, which predicts that some people will take a relatively unattractive job to gain eligibility and then quit immediately. Although our empirical model allows 1760 D. CARD AND D. R. HYSLOP 78.0 831.0 253.0 251.0 178.3 141.4 142.7 139.1 130.6 155.9 176.6 212.8 326.4 448.6 196.6 , Continues 2 94 230 260 168 141 147 157 156 159 178 214 263 483 181 a 2,831 0 0 0 5.5 1.3 10.3 13.4 21.5 25.6 30.6 44.7 59.2 90.5 REDICTIONS 458.5 155.9 P + 4 ODEL 2 1 0 0 0 14 21 25 26 25 37 62 90 M 444 141 7.1 0.0 0.0 87.2 97.9 95.0 88.1 77.3 82.0 80.5 63.4 42.0 134.8 119.4 974.7 + 3 OMPARISONS TO C 9 0 0 58 83 78 86 77 65 58 56 24 104 103 801 WITH , ROUP G 0 4.4 4.3 2.8 3.0 0.3 5.3 5.8 6.9 8.9 15.3 35.0 99.9 Number of Transitions 379.9 188.0 TABLE VII 4 3 3 4 0 5 7 5 0 ROGRAM 18 17 37 72 P 347 172 0 8.6 ATTERNS OF 44.3 27.1 70.6 20.9 19.6 15.8 20.3 21.6 23.9 28.6 P 108.3 159.7 569.3 0 9 48 33 85 39 38 49 39 41 31 26 120 198 756 ARTICIPATION 6 6 0 0 0 0 0 0 0 0 0 0 0 0 0 IA P 448 448 0 1 2 Even Sum Odd Sum Total 0 0 0 0 0 0 0 0 0 0 0 0 0 483 483 UMMARY OF S 9–14 15–20 21–26 3–8 Months on IAActual and predicted cell fractions Actual0–2 from model in Table VI, column 4 (normal heterogeneity) Predict. Actual Predict. Actual Predict. Actual Predict. Actual Predict. Actual Predict. 27–32 33–38 39–44 45–50 51–56 57–62 63–67 Total 68 69 EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1761 88.9 248.6 246.4 182.9 159.7 156.3 155.9 162.5 155.5 156.2 180.9 216.8 831.0 173.3 547.1 , 2 he column 94 230 260 168 141 147 157 156 159 178 214 263 181 483 2,831 0 0 0 4.3 0.8 8.5 15.2 23.8 30.7 38.3 44.7 52.8 86.0 138.0 443.1 + 4 2 1 0 0 0 14 21 25 26 25 37 62 90 141 444 9.4 0.0 0.0 97.6 96.5 65.5 90.7 90.8 87.8 89.8 78.7 70.1 62.6 30.5 870.0 + 3 9 0 0 58 83 78 86 77 65 58 56 24 104 103 801 Continued 0 4.7 6.4 2.5 6.5 0.1 5.7 5.6 6.6 9.0 10.7 14.8 35.5 Number of Transitions 274.5 166.4 4 3 3 4 0 5 7 5 0 18 17 37 72 347 172 TABLE VII— 0 6.9 71.5 47.3 79.4 36.0 31.8 27.8 23.1 22.6 17.5 12.8 142.0 177.6 696.3 0 9 48 33 85 39 38 49 39 41 31 26 120 198 756 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 547 547 0 1 2 Even Sum Odd Sum Total 0 0 0 0 0 0 0 0 0 0 0 0 0 483 483 Bold entries represent number of observations with the number of months on IA given in the row heading and the number of transitions off or on IA given in t a 9–14 15–20 3–8 21–26 Months on IAActual and predicted cell fractions Actual0–2 from model in Table VI, column 5 (mass-point heterogeneity) Predict. Actual Predict. Actual Predict. Actual Predict. Actual Predict. Actual Predict. 27–32 33–38 39–44 45–50 51–56 57–62 63–67 68 69 Total heading. Italic entries represent the predicted number of observations with the same IA participation history. 1762 D. CARD AND D. R. HYSLOP

FIGURE 7.—Actual and predicted IA rates for control and program groups. a greater effect on welfare participation in the first 3 months after initial eligi- bility than in the subsequent entitlement period, it is evidently too restrictive to fully capture the phenomenon. Another problem for the model is the trend in welfare participation of the eligible program group 18–42 months after random assignment. During this period the participation rate of the eligible group is very stable, whereas the

FIGURE 8.—Actual and predicted IA rates for eligible and ineligible program groups. EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1763 model predicts a decline, particularly after month 30. The predicted trend essentially tracks the trends in the control group and the ineligible program group: both of these groups show steady declines in IA participation through- out the experimental period. Even allowing for heterogeneity in the trends for different values of the random effect, the best fitting model cannot explain the absence of a parallel trend for the eligible program group. The same problem is evident in the predictions from the model with mass-point mixing. Finally, it is interesting to examine the fit of the model in months 54–69, when the treatment effects are all assumed to be zero. In this interval, the aver- age predicted welfare participation rate for the program group is a little below the actual rate, although the predicted and actual levels are nearly identical at month 69. To probe this further, we fitted a model that allowed a fraction θ of the entitlement-period treatment effects to persist after the expiration of SSP. For a specification parallel to the one in column 4 of Table VI, the estimate of θ is 0.43 (with a standard error 005), suggesting that an important frac- tion of the treatment effect persisted. Simulations of this model show that it does a better job of predicting IA participation of the eligible program group in months 55–64, but a worse job in months 65–69, underpredicting the rise in IA participation of the eligible program subgroup at the end of the follow-up period. Based on this poor fit and the evidence in Figure 1a of convergence in welfare participation, we believe that models that set the postexpiration effects to zero provide a more robust description of the data.

E. Decomposing SSP’s Effects By simulating the models in Table VI with the various treatment effects turned on or off it is possible to gain some additional insights into the be- havioral responses of the program group and, in particular, into the “hump- shaped” pattern of SSP impacts on IA participation rates shown in Figure 1a. Figure 9 uses the model in column 4 of Table VI to decompose the predicted monthly welfare participation rates of the eligible program group into selection effects, establishment effects during the transitional period, and entitlement effects during the entitlement period, while Figure 10 shows the predicted and actual SSP impacts on IA participation, along with 95% confidence intervals around the actual impacts. Beginning with Figure 9, we first describe the selection effect associated with the SSP-eligible program group. As a point of reference, the solid line represents the predicted path of IA participation for the control group. The dotted line shows the predicted welfare participation rate of the eligible pro- gram group in the absence of SSP. The divergence of this path from the solid line reflects the selective nature of the eligible program group. For example, in month 36 the model predicts a 46% IA participation rate for the eligible program group in the absence of any treatment effects, versus a 66% rate for the control group. Next, we plot the path of the eligible program group, tak- ing account only of the establishment treatment effects associated with finding 1764 D. CARD AND D. R. HYSLOP

FIGURE 9.—Decomposition of predicted IA of eligible treatments. work and leaving IA during the eligibility window (plotted as a dotted line with squares). Although the establishment treatment effects peak just after

FIGURE 10.—Actual and predicted treatment effects. EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1765 the close of the eligibility window, they persist far longer because of the high degree of state dependence in welfare participation. Finally, the fourth line in Figure 9 (with solid squares) represents the predicted IA participation rate, taking account of selection and the transitional- and entitlement-period effects of SSP on IA behavior. Comparisons of the various paths in Figure 9 show that in the first year and a half of the experiment (months 6–18) most of the overall treatment effect for the eligible program group derived from the establishment effects. Over the period from the 18th to 36th month this effect gradually dissipated and the entitlement effects dominated. Starting in month 36 and continuing through month 52 members of the eligible program group gradually exhausted their 3 years of supplement eligibility and the treatment effect faded out. Finally, af- ter month 52 all treatment effects ended and the eligible program group grad- ually returned to their path in the absence of any treatment. Although not shown in Figure 9, we have also decomposed the entitlement-period effects on IA participation into a component due to faster IA exits and a component due to slower IA entry. Roughly three-quarters of the overall entitlement-period effect is attributable to faster welfare exit rates, while one-quarter is attribut- able to reduced welfare entry rates. We have conducted simulations of the other models in Table VI and de- composed the predicted treatment effects from these models using the same approach as in Figure 9. The results are fairly similar across specifications. In particular, the models in columns 4 and 5 lead to very similar predictions for the various combinations of treatment effects. All the models suggest that the time profile of the SSP impact on IA participation on the eligible program group was driven by the combination of the one-time establishment incentive associated with the eligibility rules, and the longer-run entitlement effect on welfare entry and exit rates that ended once individuals’ SSP eligibility expired. The establishment-incentive impact reached a peak of about −20 percentage points at 15 months after random assignment, accounting for 55% of the over- all impact on the eligible program group at that point. By 3 years after random assignment, this effect had faded and accounted for 15% or less of the total impact on welfare participation. The impact of the entitlement-period effects peaks at about −20 percentage points by 2 years after random assignment and is fairly stable over the next year before dissipating as people come to the end of their 3-year supplement entitlement window. Figure 10 shows the predicted impact of SSP on the overall behavior of the program group relative to the control group (based on the model in column 4 of Table VI), along with a comparison to the actual experimental impact. Our model provides a simple and reasonably successful explanation for the distinc- tive “V-shaped” profile of the SSP impacts. Overall, the predicted and actual impacts are fairly close (and always well inside the 95% confidence bands). On close examination there are two intervals where the predicted and actual im- pacts depart systematically. One is during the period from months 30 to 42, 1766 D. CARD AND D. R. HYSLOP when the impacts declined faster than our model can predict. The other is dur- ing the period from months 54 to 69. In the first part of this interval our model tends to underpredict SSP’s impact on IA participation of the program group relative to the controls, while by month 63 the predictions are very close. On average in the post-eligibility period, then, the predicted treatment effects are slightly too small. This explains why a specification that allows a posteligibility treatment effect shows some evidence of persistence.

F. Evaluating Alternative Programs Although our model of SSP is not structural, it can be used to help evaluate the impacts of alternative subsidy programs. For example, Figure 11 compares the time profile of simulated treatment effects for the actual SSP program and two variants. The first alternative has 48 months of subsidy eligibility. In our simulation, we assume that the extended entitlement-period has no effect on the eligibility process: thus, we simply “turn on” the entitlement-period treat- ment effects for 12 extra months. Arguably, this is a lower bound on the impact of the alternative program, since a longer entitlement period would, presum- ably, involve a larger option value and encourage more people to become eligi- ble for SSP.Even ignoring any effect on eligibility, however, our results suggest that an extended entitlement period would have led to a treatment effect on the order of 7% in months 48–54, roughly double the observed impact. The second alternative has a slightly relaxed eligibility rule: in particular, the establishment period is extended by 3 months. We simulate this alternative by

FIGURE 11.—Simulated impacts of SSP program and two alternatives. EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1767 modifying the time-limit parameter Te in the eligibility model (equation (5)). This has no effect on the timing of eligibility for people who would have been eligible under the original program and simply continues the eligibility process for 3 more months, raising the predicted fraction of the treatment group who achieve eligibility from 33.8 to 39.2%. This simulation presumably overstates the impact of the extension, since some people who actually achieved eligibil- ity near the close of the establishment window might have waited longer if the deadline was extended.42 That said, the simulation suggests that allowing more time for people to establish eligibility would have led to at most a 20% larger program impact in months 18–45 and also would have shifted the peak pro- gram impact to the right somewhat. While these results show that our model can provide some insights into alternative subsidy programs, they also under- score the fact that a complete structural model is needed to fully evaluate even minor variants of the actual SSP program.

3. CONCLUSIONS The SSP experiment produced one of the largest impacts on welfare partic- ipation ever recorded in the experimental evaluation literature. At peak, SSP generated a 14 percentage point reduction in welfare participation. The im- pact of the program faded relatively quickly, however. Within 18 months of the peak impact, the gap in welfare participation between the treatment and con- trol groups of the experiment had closed by 50%; by the end of the follow-up period, the welfare participation rates of the two groups were equal. In this paper we offer an explanation for this pattern of impacts. Unlike most other experimental incentive programs, the SSP treatment group was not au- tomatically eligible for the financial treatment. Instead, eligibility was limited to those who initiated subsidy payments within a year of random assignment. Program group members faced a powerful incentive to find a job within the time limit in order to establish their eligibility for up to 3 more years of subsidy payments. Since the program rules required subsidy recipients to leave welfare, this establishment incentive generated a transitory reduction in welfare recip- iency in the program group. Members of the program group who achieved eligibility faced a continuing incentive to choose work over welfare through- out their entitlement period. We conclude that the combination of these two incentives provides a parsimonious explanation for both the large size and dis- tinctive time profile of the SSP impact. A second and related finding is that the additional work effort by the pro- gram group had no lasting impact on wages. Most of the extra hours were at jobs paying close to the minimum wage, and there was no upward trend in

42In reality there is not much of a “spike” in eligibility near the end of the establishment period. The hazard rate is 2.7% in the last 2 months, compared with a rate of 2.8% in the preceding 3 months. 1768 D. CARD AND D. R. HYSLOP wages associated with the extra hours. By 52 months after random assignment, when subsidy payments had ended, the employment rates of the program and control groups were equal and the distributions of wages of the two groups were also essentially identical. Since the marginal gain in work experience was relatively small (less than one-third of a year, on average) and members of the experimental population had significant work experience before the exper- iment, the lack of wage growth is consistent with other evidence on the effects of work experience on wages of less skilled workers. Overall, the findings from SSP suggest that welfare recipients respond to dy- namic incentives in a manner remarkably consistent with the predictions from a simple optimizing model. On the other hand, the lack of effects on wages or long-run welfare participation offers little support for the idea that temporary wage subsidies can have a permanent effect on program dependency. Dept. of Economics, 549 Evans Hall, #3880, University of California, Berkeley, CA 94705-3880, U.S.A., and NBER; [email protected] and New Zealand Treasury, P.O. Box 3724, Wellington, New Zealand; dean.hyslop@ treasury.govt.nz.

Manuscript received September, 2002; final revision received April, 2005.

APPENDIX: DATING SSP ELIGIBILITY The actual date that different individuals achieved SSP eligibility is not recorded. Based on the patterns in Figure 2a, we estimate the date of SSP eligibility as the earliest of three possible dates: (1) the first month of full-time employment; (2) the first month of SSP receipt minus 1 month for the delay in processing; (3) 14 months after random assignment. The assumption that the duration of the eligibility window was 14 months, rather than 12 as stated in the SSP rules, reflects the presence of delays in processing and administrative laxity. There are only a handful of eligible program group members for whom the minimum of the first month of full-time employment and the first month of SSP receipt (minus 1) is greater than 14. Using these dates, about 18% of the eligible program subgroup achieved eli- gibility in the first month after random assignment, 9% became eligible in each of the second and third months, and roughly 6% became eligible in each of the next 10 months. Just under 3% became eligible in the last possible month (month 14). Recognizing the delay between the start of SSP eligibility and leav- ing IA (Figure 2a), we then add 2 months to these dates for our analysis of welfare dynamics. The resulting distribution of adjusted eligibility dates ranges from 3 to 16 months after random assignment. A final decision, also made with reference to the patterns in Figure 2a, was to set the duration of the transition period to 3 months. EFFECTS OF A SUBSIDY FOR WELFARE-LEAVERS 1769

REFERENCES

ABRAMOWITZ,M.,AND I. A. STEGUN (1965): Handbook of Mathematical Functions.NewYork: Dover. AHN,H.,AND J. L. POWELL (1993): “Semiparametric Estimation of Censored Selection Models with a Non-Parametric Selection Mechanism,” Journal of Econometrics, 58, 3–29. BLUNDELL,R.,AND H. HOYNES (2004): “Has ‘In-Work’ Benefit Reform Helped the Labour Market,” in Seeking a Premier Economy: The Economic Effects of British Economic Reforms, 1980–2000, ed. by D. Card, R. Blundell, and R. Freeman. Chicago: University of Chicago Press for NBER, 411–459. BUTLER,J.S.,AND R. MOFFITT (1982): “A Computationally Efficient Quadrature Procedure for the One-Factor Multinomial Probit Model,” Econometrica, 50, 761–764. CARD, D., AND D. R. HYSLOP (2005): “Estimating the Effects of a Time-Limited Earnings Sub- sidy for Welfare Leavers,” Working Paper 05-02, Social Research Demonstration Corporation, Ottawa. CARD, D., AND D. G. SULLIVAN (1988): “Measuring the Effect of Subsidized Training Programs on Movements In and Out of Employment,” Econometrica, 56, 497–530. CHAY,K.Y.,AND D. R. HYSLOP (2001): “Identification and Estimation of Dynamic Binary Re- sponse Panel Data Models: Empirical Evidence Using Alternative Approaches,” Unpublished Manuscript, Department of Economics, UC Berkeley. FORTIN,B.,P.FRECHETTE, AND T. L EMIEUX (1994): “The Effect of Taxes on Labor Supply in the Underground Economy,” American Economic Review, 84, 983–1014. FORTIN,N.,AND T. L EMIEUX (2001): “Income Redistribution in Canada: Minimum Wages versus Other Policy Instruments,” in Adapting Public Policy to a Labour Market in Transition,ed.by W. C. Riddell and F. St. Hilaire. Kingston and Montreal: McGill–Queens University Press, 211–247. GLADDEN,T.,AND C. TABER (2000): “Wage Progression Among Less Skilled Workers,” in Find- ing Work: Jobs and Welfare Reform, ed. by R. M. Blank and D. Card. New York: Russell Sage Foundation, 160–192. GREENBERG, D., D. MEYER,C.MICHALOPOULOS, AND P. R OBINS (1992): “Simulation Estimates of the Net Employment Impacts of an Employment Subsidy Program for Long Term Welfare Recipients in Canada,” Unpublished Manuscript, Manpower Demonstration Research Corpo- ration, New York. HAM,J.,AND R. J. LALONDE (1996): “The Effect of Sample Selection and Initial Conditions in Duration Models: Evidence from Experimental Data on Training,” Econometrica, 64, 175–206. HAMILTON,G.,S.FREEDMAN,L.GENETIAN,C.MICHALOPOULOS,J.WALTER, D. ADAMS-CIARDULLO,S.MCGRODER,M.ZASLOW,J.BROOKS,S.AHLUWALIA,E.SMALL, AND B. RICCHETTI (2001): How Effective Are Different Welfare-to-Work Approaches? Five-Year Adult and Child Impacts for Eleven Programs. New York: Manpower Demonstration Research Corporation. HECKMAN, J. J (1981): “The Incidental Parameters Problem and the Problem of Initial Condi- tions in Estimating Discrete Time–Discrete Data Stochastic Processes,” in Structural Analysis of Discrete Data with Econometric Applications, ed. by C. F. Manski and D. McFadden. Cambridge, MA: MIT Press, 179–195. HECKMAN,J.J.,AND B. SINGER (1984): “A Method for Minimizing the Impact of Distributional Assumptions in Econometric Models for Duration Data,” Econometrica, 52, 271–320. HEXTER, M. B. (1917): “Persistency of Dependency—A Study in Social Causation,” Publications of the American Statistical Association, 15, 860–867. HUMAN RESOURCES DEVELOPMENT CANADA (HRDC) (1993): Inventory of Income Security Pro- grams in Canada. Ottawa: HRDC. IMBENS,G.,AND J. D. ANGRIST (1994): “Identification and Estimation of Local Average Treat- ment Effects,” Econometrica, 62, 467–476. 1770 D. CARD AND D. R. HYSLOP

LIN,W.,P.K.ROBINS,D.CARD,K.HARKNETT, AND S. LUI-GURR (1998): When Financial In- centives Encourage Work: Complete 18 Month Findings from the Self Sufficiency Project. Ottawa: Social Research and Demonstration Corporation. MICHALOPOULOS, C., D. CARD,L.A.GENNETIAN,K.HARKNETT, AND P. K . R OBINS (2000): The Self Sufficiency Project at 36 Months: Effects of a Financial Work Incentive on Employment and Income. Ottawa: Social Research and Demonstration Corporation. MICHALOPOULOS, C., D. TATTRIE,C.MILLER,P.K.ROBINS,P.MORRIS,D.GYARMATI, C. REDCROSS,K.FOLEY, AND R. FORD (2002): Making Work Pay: Final Report of the Self Suf- ficiency Project for Long Term Welfare Recipients. Ottawa: Social Research and Demonstration Corporation. MIJANOVICH,T.,AND D. LONG (1995): Creating an Alternative to Welfare: First-Year Findings on the Implementation, Welfare Impacts, and Costs of the Self-Sufficiency Project. Vancouver: Social Research and Demonstration Project. MOORE, D. (1977): “Generalized Inverses, Wald’s Method, and the Construction of Chi-Squared Te s t s o f F i t , ” Journal of the American Statistical Society, 72, 131–137. MORTENSEN, D. T. (1997): “ Insurance and Job Search Decisions,” Industrial and Labor Relations Review, 30, 505–517. (1986): “Job Search and Labor Market Analysis,” in Handbook of Labor Economics, Vol. II, ed. by O. Ashenfelter and R. Layard. New York: North-Holland, 849–920. PHELPS, E. S. (1994): “Low-Wage Employment Subsidies versus the Welfare State,” American Economic Review, 84, 54–58. PLANT, M. W. (1984): “An Empirical Analysis of Welfare Dependence,” American Economic Re- view, 74, 673–694.