<<

A Service of

Leibniz-Informationszentrum econstor Wirtschaft Leibniz Information Centre Make Your Publications Visible. zbw for Economics

Jappelli, Tullio; Pistaferri, Luigi

Working Paper Intertemporal choice and mobility

CFS Working Paper, No. 2005/28

Provided in Cooperation with: Center for Financial Studies (CFS), Goethe University Frankfurt

Suggested Citation: Jappelli, Tullio; Pistaferri, Luigi (2005) : Intertemporal choice and consumption mobility, CFS Working Paper, No. 2005/28, Goethe University Frankfurt, Center for Financial Studies (CFS), Frankfurt a. M., http://nbn-resolving.de/urn:nbn:de:hebis:30-23408

This Version is available at: http://hdl.handle.net/10419/25464

Standard-Nutzungsbedingungen: Terms of use:

Die Dokumente auf EconStor dürfen zu eigenen wissenschaftlichen Documents in EconStor may be saved and copied for your Zwecken und zum Privatgebrauch gespeichert und kopiert werden. personal and scholarly purposes.

Sie dürfen die Dokumente nicht für öffentliche oder kommerzielle You are not to copy documents for public or commercial Zwecke vervielfältigen, öffentlich ausstellen, öffentlich zugänglich purposes, to exhibit the documents publicly, to make them machen, vertreiben oder anderweitig nutzen. publicly available on the internet, or to distribute or otherwise use the documents in public. Sofern die Verfasser die Dokumente unter Open-Content-Lizenzen (insbesondere CC-Lizenzen) zur Verfügung gestellt haben sollten, If the documents have been made available under an Open gelten abweichend von diesen Nutzungsbedingungen die in der dort Content Licence (especially Creative Commons Licences), you genannten Lizenz gewährten Nutzungsrechte. may exercise further usage rights as specified in the indicated licence. www.econstor.eu

No. 2005/28 Intertemporal Choice and Consumption Mobility

Tullio Jappelli and Luigi Pistaferri

Center for Financial Studies

The Center for Financial Studies is a nonprofit research organization, supported by an association of more than 120 banks, insurance companies, industrial corporations and public institutions. Established in 1968 and closely affiliated with the University of Frankfurt, it provides a strong link between the financial community and academia.

The CFS Working Paper Series presents the result of scientific research on selected top- ics in the field of money, banking and finance. The authors were either participants in the Center´s Research Fellow Program or members of one of the Center´s Research Pro- jects.

If you would like to know more about the Center for Financial Studies, please let us know of your interest.

Prof. Dr. Jan Pieter Krahnen Prof. Volker Wieland, Ph.D.

CFS Working Paper No. 2005/28

Intertemporal Choice and Consumption Mobility*

Tullio Jappelli1 and Luigi Pistaferri2

April 2005

Abstract: The theory of intertemporal consumption choice makes sharp predictions about the evolution of the entire distribution of household consumption, not just about its conditional mean. In the paper, we study the empirical transition matrix of consumption using a panel drawn from the Bank of Survey of Household Income and Wealth. We estimate the parameters that minimize the distance between the empirical and the theoretical transition matrix of the consumption distribution. The transition matrix generated by our estimates matches remarkably well the empirical matrix, both in the aggregate and in samples stratified by education. Our estimates strongly reject the consumption insurance model and suggest that households smooth income shocks to a lesser extent than implied by the permanent income hypothesis.

JEL Classification: D52, D91, I30

Keywords: Consumption Dynamics, Mobility

* Thanks are due to three anonymous referees, the Editor Roberto Perotti, and to Orazio Attanasio, Chris Carroll, Miles Kimball, Raffaele Miniaci, John Pencavel, and seminar participants at the 2004 Meeting of the Society of Economic Dynamics in Florence, Padua, Seattle, and Michigan for comments. Work supported in part by the European Community’s Human Potential Programme under contract HPRN-CT-2002-00235 [AGE: The Economics of Aging in Europe], the “Taube Faculty Research Fund” at the Stanford Institute for Economic and Policy Research and the Italian Ministry of Education, Universities and Research (MIUR).

1 University of Salerno, CSEF, and CEPR, email: [email protected]

2 and CEPR, email: [email protected] 1 Introduction

How much consumption responds to income shocks is a central question in macroe- conomics. As is well known, this response depends on the particular model that characterizes consumption behavior. For instance, the permanent income hypoth- esis (PIH) implies that households set consumption equal to permanent income, smoothing out transitory income fluctuations, but responding one-to-one to per- manent shocks. On the other hand, under perfect insurance markets the distribu- tion of marginal utilities is constant over time, and consumption does not respond to idiosyncratic income shocks. Deaton and Paxson (1994) have studied the im- plications of these theories for the dynamics of consumption inequality. They show that under the PIH or other models with incomplete markets, consump- tion inequality within a group of households with fixedmembershipshould,on average, increase with age. By contrast, the cross-sectional variance of marginal utilities should be constant over time under perfect insurance markets. The dy- namics of consumption inequality is therefore informative about the impact of income shocks and about the validity of models of consumption behavior.1

1Cutler and Katz (1992) provide descriptive analysis of consumption inequality. More re- cently, motivated by the increase in income inequality in the US, a strand of papers has analyzed patterns of consumption inequality. Heatchote, Storesletten and Violante (2003), for example, explore the implications of the recent sharp rise in US wage inequality for welfare and the cross-sectional distributions of hours worked, consumption and earnings.

3 Our point of departure from this literature is that measures of consumption inequality do not always provide an accurate measure of household behavior. Con- sumption inequality is a static concept, and as such it cannot handle violations of the life cycle-permanent income hypothesis (such as borrowing constraints or my- opic behavior), which would imply a role for transitory income fluctuations over and above permanent fluctuations, or buffer stock behavior, which would imply more smoothing of income shocks than predicted by the standard model. The handling of these issues, we argue, calls for an analysis of consumption mobility.

The distinction between consumption inequality and consumption mobility is, effectively, a distinction between static and dynamic features of a distribution.

Inequality refers to the dispersion of consumption at a point in time. Mobil- ity describes movements within the consumption distribution as time goes by.

Studies of consumption inequality may record no change in the dispersion of the underlying distribution even in the presence of intra-distributional movements, with direct implications for welfare analysis.2 Despite the importance of these issues, to the best of our knowledge the present paper represents the first at-

2In contrast to the analysis of consumption, there is a long tradition of studies of earnings and income mobility. Existing contributions can be divided into two broad groups. A first group analyzes transition probabilities across quantiles of the earnings distribution by Markov-chain models (e.g., Shorrocks, 1978). A second approach is to specify and estimate a process for the conditional mean of earnings (e.g., Lillard and Willis, 1978).

4 tempt to analyze consumption mobility, both theoretically and empirically.3 As we shall see, the analysis of consumption mobility delivers new implications of various theoretical models of intertemporal choice, generates new empirical tests and insights of those models and allows to estimate the impact of income shocks on consumption.

The paper attempts to understand which model of intertemporal consumption choice is capable of explaining the amount of consumption mobility we observe in the data. We focus on several consumption theories, among which the theory of consumption insurance, the rule-of-thumb model, and the PIH model have received the widest attention. We nest these popular consumption models and estimate the parameters that minimize the distance between the empirical and the theoretical transition matrix of the consumption distribution. The exercise is performed constructing a transition matrix for consumption and testing different hypotheses concerning consumption dynamics. Since to measure mobility one needs to follow households over time, the empirical analysis is conducted on a panel drawn from the Bank of Italy Survey of Household Income and Wealth

3Phelan (1994) construct transition matrices for consumption and leisure for the purpose of characterizing the properties of constrained-efficient allocations in dynamic economies with private information. Krueger and Perri (2003) estimate the persistence of consumption using the second largest eigenvalue of the transition matrix of consumption. They do not consider the implications of their theoretical models for consumption mobility.

5 for the years 1987 to 1995. The survey we use is representative of the Italian population, contains a measure of total non durable consumption and has good quality income data. Since there are virtually no panel datasets with broad consumption measures, a by-product of this paper is to bring the data set to the attention of empirical macroeconomists.

To see how the theory of intertemporal choice delivers implications for con- sumption mobility, consider first the extreme case of full consumption insurance.

According to this theory, the cross-sectional distribution of the marginal utility of consumption of any group of households is constant over time. Of course ag- gregate consumption can increase or decrease, so that the growth of the marginal utility for any household can be positive or negative, but the relative position of each household in the cross-sectional distribution of marginal utilities does not change over time. Consumption insurance makes therefore strong predictions about the entire distribution of marginal utilities, not just its mean or variance.

In particular, consumption insurance implies absence of mobility of the marginal utility of consumption between any two time periods, regardless of the nature of the individual income shocks and the time frame considered. If one observes people moving up and down in the distribution of marginal utilities one must therefore conclude that some people are not insulated from idiosyncratic shocks,

6 a contradiction of the consumption insurance hypothesis.4

A second case we consider is the rule-of-thumb model which predicts that households simply set consumption equal to income in each period. Given that any change in current income translates into an equivalent change in consumption, one should expect a relatively high degree of consumption mobility if shocks are not correlated with the rank position in the initial distribution of consumption.

In more realistic models with incomplete markets and insurance opportunities, individuals use saving as a self-insurance device and are able to smooth away at least some of the income variability. In these models the extent of consumption smoothing depends, among other things, on market imperfections arising from informational or enforceability problems and on the presence of liquidity con- straints. Within this class of models, the best known is the PIH, in which income shifts over time because of transitory (e.g., mean reverting) and permanent (e.g., persistent or non-mean reverting) shocks. If people behave according to the PIH, consumption reacts mostly to permanent unanticipated income shocks but is al- most insensitive to transitory ones. Households will therefore move up and down in the consumption distribution only in response to permanent shocks. Thus, in the absence of taste shocks or measurement error, one should expect a degree of mobility that is intermediate between the level predicted by the consumption

4Although this implication of consumption insurance was mentioned in a theoretical paper by Banerjee and Newman (1991), to our knowledge it has never been explored empirically.

7 insurance hypothesis and the rule-of-thumb model.

The rest of the paper is organized as follows. Section 2 presents the data and the empirical transition matrix of the marginal utility, measured with log consumption per capita . The empirical matrix turns out to be quite robust to various adjustments and controls for the effect of family size and labor supply on preferences. In Section 3 we review the implications for consumption dynamics of the theories of intertemporal consumption choice and consider how to account for measurement error in consumption and taste shocks in the utility function.

In Section 4 we estimate the parameters of the consumption rule by minimizing the distance between the empirical and the simulated transition matrix of the consumption distribution. The results, presented in Section 5, reject statistically each of the simple representations of the consumption decision rule, and reveal that households smooth income shocks to a lesser extent than implied by the

PIH. The estimated parameters are also able to reproduce remarkably well the difference in consumption mobility that we observe in samples stratified according to education. The results are robust to the presence of measurement error in income and remain unchanged if we estimate simultaneously the parameters of the income process and of the consumption rule. Section 6 summarizes our results.

8 2 Measuring consumption mobility

The first step of our analysis is to construct an empirical transition matrix of con- sumption. This requires longitudinal household data. For this purpose we use the

1987-1995 panel of the Italian Survey of Household Income and Wealth (SHIW).

This data set contains measures of consumption, income, and demographic char- acteristics of households. The SHIW provides a measure of total non-durable consumption, not just food, thus overcoming one of the main limitations of other panels, such as the PSID, that have been used to test for intertemporal consump- tion choice.

The SHIW is conducted by the Bank of Italy which surveys a representative sample of the Italian resident population. Sampling is in two stages, first munici- palities and then households. Municipalities are divided into 51 strata defined by

17 regions and 3 classes of population size (more than 40,000, 20,000 to 40,000, less than 20,000). Households are randomly selected from registry office records.

From 1987 through 1995 the survey was conducted every other year and covered about 8,000 households, defined as groups of individuals related by blood, mar- riage or adoption and sharing the same dwelling. Starting in 1989, each SHIW has re-interviewed some households from the previous surveys. The panel compo- nent has increased over time: 15% of the sample was re-interviewed in 1989, 27% in 1991, 43% in 1993, and 45% in 1995.5 The response rate (ratio of responses

5In the panel component, the sampling procedure is also determined in two stages: (i) selec-

9 to contacted households net of ineligible units) was 25% in 1989, 54% in 1991,

71% in 1993, and 78% in 1995.6 While these figures uncover considerable sample attrition especially in the early years of the survey, they are comparable to those obtained in other microeconomic data sets. For instance, in 1994 the net response rate in the US Consumer Expenditure Survey was 83% for the Interview sample and 81% for the Diary sample. Given the rotating sample structure, the number of repeated observations on households in our sample ranges from a minimum of twotoamaximumoffive. Ample details on sampling, response rates, processing of results and comparison of survey data with macroeconomic data are provided by Brandolini and Cannari (1994).

The total number of consumption transitions that is, observations for the − same household that are repeated over two adjacent years of data is 10,508. − To minimize measurement error we exclude cases in which the head changes over the sample period or gives inconsistent age figures. In most cases, the excluded households are those facing breaking-out events (widowhood, divorce, separation, etc.), leading to changes in household head. Inconsistent age figures can reflect tion of municipalities (among those sampled in the previous survey); (ii) selection of households reinterviewed. This implies that there is a fixed component in the panel (for instance, house- holds interviewed 5 times between 1987 to 1995, or 4 times from 1991 to 1995) and a new component every survey (for instance, households reinterviewed only in 1989). 6Response rates increase in 1991 because in that year households included in the panel were chosen among those that had previously expressed their willingness to being re-interviewed.

10 unrecorded change in household head or measurement error. After these exclu- sions, the sample has 9,214 consumption and income transitions. Consumption is the sum of all expenditure categories except durables. Income is defined as the sum of labor income and transfers of all household members, excluding income from assets. These are the standard consumption and income concepts used in studies that test the implications of the permanent income hypothesis.7

Table 1 reports sample statistics of log consumption, income and other house- hold characteristics. All statistics are computed using sample weights. The panel is relatively stable over the sample period. Consumption grows consid- erably between 1987 and 1989 and is roughly constant afterwards. Over time, family size declines while the number of income recipients increases. Other de- mographic characteristics remain roughly unchanged. Self-employment slightly falls over time. Income strongly declines in 1993, a recession year, while disper- sion increases. In all years, household disposable income is more variable than consumption. Note also the stability of the cross-sectional variance of log con- sumption as opposed to the wide fluctuations in the cross-sectional variance of log income. The pattern of the Gini coefficients for consumption and income con-

firms that the income distribution is less equal than the consumption distribution

(34 vs. 28%). Interestingly, the 1993 recession boosts income inequality while

7Adding back asset income or asset income net of imputed rents does not change the main results of the paper.

11 leaving consumption inequality unaffected. These descriptive statistics are con- sistent with models in which households are able to smooth away at least some of the income shocks.

The focus of the present analysis, however, is not consumption inequality but consumption mobility. In what follows, we focus on the transition matrix of the logarithm of non-durable per capita consumption (log (c)). We also check the sensitivity of the results using different consumption equivalent scales and the interactions between consumption and labor supply (see below). Note that the

γ transition matrix estimated for log (c) is identical to that estimated for c− ,where

1 1 γ γ is a constant. Thus if individual utility is isoelastic (u (c)=(1 γ)− c − ), − and in the absence of taste shocks and measurement error in consumption, the empirical transition matrix of log consumption can also be interpreted as the empirical transition matrix of marginal utilities, a convenient feature in the light of our empirical strategy below.

2.1 The empirical transition matrix

Assume that P is an unobservable q q stochastic transition matrix of log house- × hold consumption, q being the number of consumption classes in the distribution.

These classes could be determined exogenously or estimated from the quantiles of the empirical distribution. For notational simplicity we consider transition probabilities from period t to period t + 1; extending the argument to transition

12 probabilities in periods t +2,t + 3, and so on, is straightforward. The generic element of P is pij, the probability of moving from class i in period t to class j in period t + 1 conditioning on being in class i in period t.Define nij as the number of households that move from class i in period t to class j in period

q t +1,ni = i=1 nij as the total number of observations in each row i of P,and

q P n = i=1 ni the total number of observations. The maximum likelihood estima- P tor of the first-order Markov transition probabilities is pij = nij/ni (Anderson

8 and Goodman, 1957). b

Table 2 reports the transition matrix of log per capita consumption from 1987-

89 to 1993-95. Recall that the SHIW is conducted every two years, so we observe transitions from period t 2toperiodt. The elements of the main diagonal report − the proportion of households that did not change quartile. For instance, the entry in the top left cell of the 1993-95 panel indicates that 68% of the households in the first quartile in 1993 were still in that quartile two years later. Off-diagonal elements signal consumption mobility. For instance, the second entry in the first

8There are two methods for constructing the empirical counterpart of P. One is to keep the width of the consumption interval constant and let the number of observations within each interval vary. The alternative, more standard method, is to keep constant the marginal prob- abilities and let the interval width change, for instance dividing the distribution into discrete quantiles. We proceed using quartiles throughout; results with deciles are qualitatively similar and are not reported for brevity.

13 row indicates that 25% of households moved from the first quartile in 1993 to the second quartile in 1995. The transition matrices for other years are similar, displaying substantial amount of consumption mobility.9

Each of the transition matrices is based on the distribution of log per capita consumption, and do not take into account the fact that household expenditures might be affected by demographic variables and labor supply choice, breaking the link between the dynamics of log consumption and that of the marginal utility of consumption.

Changes in family size, for instance the arrival of children, alter family needs, hence consumption allocations. Similarly, if household expenditures are charac- terized by economies of scale, one would observe mobility in consumption per capita even if the distribution of consumption per adult equivalent is constant over time. We thus compute transitions using log consumption per adult equiv- alent rather than per capita.10 The pattern of the transition matrix and of the

9In the simulation analysis we will make the assumption that consumption mobility is gener- ated by a symmetric distribution of income shocks, which implies that our simulated transition matrix is also symmetric. It is therefore of interest to check if the transition matrix is symmetric using the maximum likelihood test suggested by Bishop, Fienberg and Holland (1975). The

2 2 statistic is of the form Ψ = i>j (pij pji) / (pij + pij ) χq(q 1)/2.Thep-value of the test is − ∼ − P close to 1 for all years, and does not reject the null hypothesis of symmetry. 10There is a large literature on the cost of children and on the economies of scale in consump- tion, see Deaton (1997) for a survey. Any particular choice of an equivalence scale is therefore

14 associated mobility index is unaffected. As a further check, we restrict attention to households whose demographic structure did not change over the sample pe- riod and find, again, similar consumption transitions.11 In the remaining of the paper we thus focus on log consumption per capita.

If leisure is an argument of the utility function, and if consumption and leisure are non-separable, consumption decisions are affected by predictable changes in households’ labor supply (Attanasio, 2000). This implies that the dynamics of consumers’ rank in the consumption distribution depends, among other things, on changes in hours of work. Failure to control for changes in labor supply might therefore induce consumption mobility even in the absence of income and other idiosyncratic shocks. The interaction between consumption and labor supply is unlikely to affect the transition matrix of log consumption, however. First of all, in our sample hours worked by individual household members and the proportion of spouses working do not change appreciably over the period considered. Sec- to a certain extent arbitrary, depending on the estimation method and assumptions about the utility function. We rely on a plausible equivalence scale that is consistent with current liter- ature, assigning a weight of 1 to the first adult, 0.8 to any additional adult and 0.25 to each household member less than 18 years old. We obtain similar results changing the parameters of the equivalence scale within a range of realistic estimates (0.1 to 0.5 for children, 0.6 to 1 for adults). 11For instance, excluding households with changes in family composition results in a mobility index of 0.576 in 1993-95.

15 ond, even if we exclude households reporting changes in labor force participation

(which induce the most dramatic shifts in labor supply), the consumption transi- tion matrix is almost identical to the full sample matrix. For instance, when we apply this restriction the main diagonal elements of the 1993-95 transition ma- trix are 0.67, 0.42, 0.43, 0.69 (instead of 0.67, 0.43, 0.43, 0.69). These robustness checks suggest that in our sample the transition matrix of marginal utilities is not affected by demographic changes or by the labor supply status of the household.

2.2 Inference

The transition matrices in Table 2 are informative about the amount of consump- tion mobility - or the mobility of marginal utility - that we observe in the data. In the next section we derive from theory the implications for consumption mobility of popular models of intertemporal choice (PIH, rule-of-thumb, and consumption insurance), nest the models in a convenient unifying framework, and confront them with the data. Then, in Section 4 we discriminate among these models by estimating the parameters that minimize the distance between the empirical and the theoretical transition matrix of the consumption distribution.

To make inference about the empirical transition matrix, we will rely on the modified χ2 goodness-of-fit statistic proposed by Anderson and Goodman (1957):

q q 0 2 pij pij 2 ni − χq(q 1) (1) p ∼ − i=1 j=1 ¡ ij ¢ X X b b 16 The statistic compares estimated and theoretical transition probabilities and can

0 be used to test the null hypothesis that pij = pij for all i, j. A similar statistic can be used to test if the transition matrix differs statistically over time or between population groups:

q q g0 g1 2 pij pij 2 Ni − χq(q 1) p ∼ − i=1 j=1 ¡ ij ¢ X X b b where pij is the estimate of pij obtainedb pooling data for two groups or time

1 g0 g1 periodsbg0 and g1,andNi− =(1/ni )+(1/ni ).

In the empirical application we will also be interested in matching the em- pirical transition matrix with a simulated matrix that depends on a vector of unknown parameters θ. Thisestimationproblemcanbeaddressedbyimple- menting a minimum χ2 method, i.e. minimizing the function:

q q 2 (pij pij (θ)) ni − (2) p (θ) i=1 j=1 ij X X b The properties of this estimator are discussed in Neyman (1949). In Appendix

2weshowthat(2)canberewrittenas:

1 (p p (θ)) Ω (θ)− (p p (θ))0 (3) − − where after deleting a columnb from the theoreticalb and empirical transition − matrices to avoid singularity p is the vector of estimated transition probabilities, − b 17 p (θ) the vector of theoretical transition probabilities, and Ω (θ) the covariance matrix of the distance vector (p p (θ)). The function (3) has therefore the − optimal minimum distance formb of Chamberlain (1982) that econometricians are familiar with.12

Neyman (1949) also proposed a modified minimum χ2 method, where the function to minimize is

q q 2 (pij pij (θ)) n − i p i=1 j=1 ij X X b 1 Appendix 2 proves that this function can beb rewritten as (p p (θ)) Ω− (p p (θ))0, − − where Ω uses the estimated pij to construct an estimate of theb covarianceb matrixb of (p bp (θ)). − b

Whenb the expression for pij (θ) is available in closed form, implementation of the minimum χ2 criterion is straightforward. When it is not, as in our case, one must rely on simulations to generate the transition probability conditional on θ,

2 and then apply the minimum χ method to the simulated pij (θ). Details are provided in Section 4 and in Appendix 1.

12Note that when dim (p) > dim (θ), the minimized value of (3) can also be interpreted as

b 2 the statistic for testing overidentifying restrictions. The statistic is distributed χdim(p) dim(θ). − b

18 3 Intertemporal choice and mobility

To explore the relation between the consumption and the income distributions, it is useful to start by presenting a fairly general characterization of the income process. Starting with Hall and Mishkin (1982), it has become quite standard in panel data studies of income and consumption dynamics to express log income of household h in period t as:

ln yh,t = βXh,t + ph,t + eh,t (4)

where Xh,t is a set of deterministic variables such as age and region of residence,

13 and ph,t and eh,t are permanent and transitory components, respectively. The latter is the sum of an idiosyncratic (εh,t) and an aggregate component (εt); both are assumed to be serially uncorrelated. Since the permanent component of income changes very slowly, the standard assumption is to model it as a random walk process of the form:

ph,t = ph,t 1 + zh,t (5) −

where zh,t is the permanent innovation, which is again the sum of an idiosyncratic

(ζh,t) and an aggregate shock (ζt); both components are serially uncorrelated.

13The logarithmic transformation eliminates heteroskedasticity in the distribution of income in levels.

19 We also assume that εh,t and ζh,t are mutually uncorrelated disturbances with

2 2 variances σε and σζ , respectively. Since we operate with a short panel, transitory and permanent aggregate shocks will be estimated by a vector of time dummies, dt.

The decomposition of income shocks into transitory and permanent compo- nents dates back to Friedman (1957). Some of the income shocks are transitory

(mean reverting) and their effect does not last long. Examples include fluctu- ations in overtime labor supply, bonuses, lottery prizes, and bequests. On the other hand, some of the innovations to earnings are highly persistent (non-mean reverting) and their effect cumulates over time. Examples of permanent innova- tions are generally associated with job mobility, promotions, lay-off,andsevere health shocks.

Given our assumptions, income growth can be written as:

∆ ln yh,t = ∆dt + β∆Xh,t + ζh,t + ∆εh,t (6)

As we shall see, this income process delivers different implications for con- sumption mobility for different models of intertemporal choice. We also consider how these implications change in the presence of taste shocks, measurement error in consumption, and measurement error in income.

20 3.1 The Permanent Income Hypothesis

The first model we consider is a version of the PIH with CRRA preferences

1 1 γ 1 u (c)=(1 γ)− c − ,whereγ− is the elasticity of intertemporal substitution. − We assume that infinitely lived households maximize expected utility under per- fect credit markets, subject to an intertemporal budget constraint. We assume that income follows the process (4)-(5) and that it is the only source of uncer- tainty of the model. As in Blundell and Preston (1998) and Blundell, Pistaferri and Preston (2003), we approximate the Euler equation for consumption with a second-order Taylor expansion and assume that r = δ, that consumption equals permanent income, and that the conditional variance of income shocks varies only in the aggregate. One can show that under such assumptions, individual

PIH consumption growth depends on an aggregate component mt and unantici- pated idiosyncratic income shocks:

r ∆ ln c = mPIH + ε + ζ (7) h,t t 1+r h,t h,t

Equation (7) indicates that the optimal rule is to respond one-to-one to per- manent shocks ζ and to revise consumption only by the annuity value of the income innovation in case of transitory shocks ε. Thisisinfactthebasicin- sight of the PIH, where people self-insure against high-frequency income shocks but adjust their consumption fully in response to low-frequency shocks. As we shall see, a convenient feature of equation (7) is that it can easily be nested with

21 consumption rules derived from different models.

Suppose now that we observe a given cross-sectional distribution of consump- tion at time t 1 and that the income shocks are not perfectly correlated with the − consumption rank of each household in the cross-section. Since aggregate shocks are by definition identical for all households, they do not change each consumer’s rank in the consumption distribution and therefore they will not induce any con- sumption mobility: if they were the sole source of consumption fluctuations the mobility index would be zero.14 However, other shocks are idiosyncratic, and will move people up and down in the consumption distribution, to an extent that de- pends on the variance of the two shocks. But since the impact of transitory shocks is scaled down by the factor r/ (1 + r), we expect the variance of the permanent shocks to have the greatest impact on mobility.

Simulation results produced by Carroll (2001) show that with constant rela- tive risk aversion, impatient consumers and an income process similar to the one we use, the implication of the PIH that transitory income shocks have a negligi- ble impact on consumption still holds true. Permanent shocks, however, have a somewhatlowerimpactinbuffer stock models. In fact, in such models permanent

14Suppose that income shocks were instead perfectly and positively correlated with the rank of household consumption in the cross-section. Then, the poorest households receive the largest negative shocks and the richest the largest positive shocks, implying no mobility as in the consumption insurance case.

22 income shocks reduce the ratio of wealth to permanent income, thus increasing also precautionary saving. Under a wide range of parameter values, Carroll shows that in this class of models the marginal propensity to consume of a permanent income shock is about 0.9, not far from that of the approximation in (7). There- fore, empirically it is difficult to distinguish the PIH from buffer stock models on the basis of the marginal propensity to consume out of permanent income shocks. But the main intuition is still valid: if individuals smooth consumption and understand the income generating process, transitory income shocks should have a negligible impact on consumption. To account for the effect remarked by

Carroll and others, in the empirical analysis we take into account the degree of consumption smoothing arising from precautionary savings estimating:

r ∆ ln c = mPIH + φ ε + ζ h,t t 1+r h,t h,t µ ¶ In buffer stock models (φ < 1), assets accumulated for precautionary purposes allow people to smooth income shocks to a larger extent than in the PIH model

(φ =1).

3.2 The rule-of-thumb model

Let’s assume that consumption equals income in each period, i.e.:

ln ch,t =lnyh,t

23 This model has been often proposed as a simple, yet extreme alternative to the PIH to describe the behavior of households that do not use savings to buffer income shocks but spend all they receive. Some authors rationalize this model by appealing to the presence of binding liquidity constraints in each period. Laibson

(1997) shows that it is the equilibrium outcome for hyperbolic (or golden-eggs) consumers.15 We term it rule-of-thumb model to indicate a situation in which consumption tracks income closely, even when individuals have accumulated as- sets in previous periods. The model is an interesting case to study because it approximates the behavior of consumers with short horizons, limited resources, or hyperbolic discount factors, giving an upper bound for the sensitivity of con- sumption to income shocks.

Using the income process (4)-(5), the dynamic of consumption is given by:

K ∆ ln ch,t = mt + εh,t εh,t 1 + ζh,t (8) − −

K where mt is the effect of the aggregate shocks on consumption in the rule-of- 15In the hyperbolic consumer model, individuals have preferences that change over time (there are different selves in different periods). In the model proposed by Laibson (1997) self t 1 − chooses assets at 1 to constrain the consumption of self t. This is done by keeping most assets − invested in an illiquid instrument. Hence, at any point in time, the consumer is effectively liquidity constrained, even though the constraint is self-imposed. Laibson (1997) shows that in equilibrium consumption is exactly equal to the current level of cash flow, or total income.

24 thumb model. According to the model the growth rate of consumption is there- fore equally affected by current and lagged transitory shocks and by permanent shocks. The main difference with the PIH is that in the rule-of-thumb model tran- sitory shocks impact one-to-one on consumption. It is precisely for this reason that in the rule-of-thumb model one should expect more consumption mobility than under the permanent income rule: there is another channel through which households move to a different quartile from one period to the next.

3.3 Consumption insurance

To illustrate the implications of the theory of intertemporal choice with complete insurance markets, let us keep the assumption that households have preferences of the CRRA type. The implications of the model are identical for any power utility function. As shown, among others, by Mas-Colell, Whinston and Greene

(1995), the optimal transition law for consumption with complete markets can be obtained by assuming that there is a social planner who maximizes a weighted sum of individual households’ utilities. The Lagrangian of this problem can be written as:

L = λh πs,tu(ch,s,t)+ µ Cs,t ch,s,t s,t − h s t s t à h ! X X X X X X where h, s and t are subscripts for household h in the state of nature s in period t,

λh is the social weight for household h, µs,t is the Lagrange multiplier associated

25 with the resource constraint, πs,t the probability of the realization of state s in period t,andCs,t aggregate consumption in state s and period t.

The first order condition can be written in logarithms as:

γ ln ch,s,t =lnµ ln λh ln πs,t − s,t − −

To obtain the growth rate of consumption, subtract side-by-side from the same expression at time t 1: −

1 1 CI ∆ ln ch,t = γ− ∆ ln µ + γ− ∆ ln πt m (9) − t ≡ t where we drop the subscript s because only one state is realized in each period.

The two terms on the right-hand-side of equation (9) represent genuine aggregate effects. The first term is the growth rate of the Lagrange multiplier, the second is the growth rate of the state probabilities. Note that first-differencing has eliminated all household fixed effects (µ and π in equation 9 are not indexed by h).

Equation (9) states that the growth rate of the marginal utility of consumption

(or of consumption, for that matter) of each household is the same. This implies that the initial cross-sectional distribution of consumption is a sufficient statis- tic to describe all future distributions. Since all households experience the same growth rate of marginal utility, and given our assumption of isoelastic utility, their

26 rank in the consumption distribution is stationary. Note that the stationarity of the cross-sectional distribution is directly implied by the assumption that insur- ance markets fully insulate households from idiosyncratic shocks. The statistical counterpart of consumption insurance is that the transition matrix for household consumption is an identity matrix. The extreme assumptions of this model are clearly unrealistic. However, the model provides a lower-bound for the impact of income shocks on consumption and is therefore a useful theoretical benchmark.

The discussion in Sections 3.1-3.3 suggests that under CRRA preferences (and no measurement error or taste shocks), consumption mobility should be zero in the consumption insurance model, intermediate in the permanent income model, and highest in the rule-of-thumb model. In practice, the presence of measurement error and/or taste shocks does not provide a clear-cut ordering of the models in terms of expected mobility. Nevertheless, it is still possible to distinguish between the different models by estimating the parameters that determine the extent of consumption mobility that we observe in the data.

3.4 Nesting the three models

The distinction between the three models is useful but too stylized for empirical applications. Consumption insurance is no less unrealistic than assuming that all income is consumed in each period, or that all households follow exactly the PIH.

In the empirical application we therefore nest the three models and estimate the

27 parameters of the following flexible consumption rule:

λ + r ln ch,t =lnch,t 1 + φ εh,t λεh,t 1 + ζh,t (10) − 1+r − − µ ¶

Since aggregate shocks do not affect consumption mobility under each of the models discussed above, for notational simplicity the equation above omits the

j aggregate component mt . However, when we estimate the income process we control for aggregate shocks by introducing time dummies in the regression (dt in equation 6).

The two parameters λ and φ allow to distinguish various forms of departure from the stylized models of intertemporal choice. Consider first the case in which

φ = 1. The parameter λ represents the extent to which consumption responds to income over and above the amount warranted by the PIH, i.e., the excess sensitivity of consumption to current and past income shocks. One way to in- terpret this parameter is that each household sets consumption equal to income with probability λ (perhaps because of binding liquidity constraints or hyper- bolic discounting) and follows the PIH with probability (1 λ). Note that with − λ = 0 the expression (15) reduces to the PIH, while with λ = 1 one obtains the rule-of-thumb model where consumption equals income each period.

Consider now the situation in which φ = 0. Income shocks play no role in the consumption insurance model. But intermediate cases in which 0 < φ < 1 are interesting and potentially informative, as discussed in Section 3.1. Some consumers have assets accumulated for precautionary reasons which allow them

28 to smooth income shocks to a larger extent than in the PIH model (where φ =1).

3.5 Measurement error and taste shocks

The consumption transition law is derived assuming that there is no measure- ment error in consumption. In practice consumption mobility, as estimated from the data, is potentially upward biased because of reporting errors. If respon- dents report their consumption with errors, one will find units moving up and down even if their true rank in the consumption distribution is unchanged; hence, measurement errors affect consumption dynamics and the extent of mobility we measure in the data. In the estimation it is therefore important to account ex- plicitly for measurement error. We will do so by assuming that true consumption is measured with a multiplicative error:

ln ch,t∗ =lnch,t + ωh,t (11)

e where ln c∗ is measured consumption and ω is an independently and identically distributed measurement error. e

As explained in Section 2, another source of consumption mobility are taste shocks. Even though in Section 2 the actual empirical matrix is quite stable across a wide range of robustness checks (such as defining consumption in terms of adult equivalents, or restricing the sample to minimize the potential impact of leisure on consumption transitions), the impact of unobserved taste shocks

29 cannot be ruled out. To see what impact they have on the empirical estimates,

1 1 γ we rewrite the utility function as u (c, ξ)=(1 γ)− c − exp θξ ,whereξ is an − ³ ´ i.i.d. shock to individual preferences. e e

The combined presence of measurement error and taste shocks leads to the following reformulation of the consumption dynamics in equation (10):

λ + r ln ch,t∗ =lnch,t∗ 1 + φ εh,t λεh,t 1 + ζh,t − 1+r − − µ ¶

+(ωh,t ωh,t 1)+ψ ξh,t ξh,t 1 (12) − − − − ³ ´ e e e e where ψ is a function of θ, γ,andλ.

Without loss of generality, assume that in each period the standard deviation of measurement error (σω) is a fraction ηω of the standard deviation of measured

e e consumption, σω = ηωσln c∗ , and that the standard deviation of taste shocks (σξ)

e e e is a fraction ηξ of the standard deviation of measured consumption, σξ = ηξσln c∗ .

e e e Since the variables are expressed in logs, ηω and ηξ can be interpreted as the percentage variability in observed consumptione duee to reporting error and taste shocks, respectively. Rewrite (12) as

λ + r ln ch,t∗ =lnch,t∗ 1 + φ εh,t λεh,t 1 + ζh,t − 1+r − − µ ¶

+ηω (ωh,t ωh,t 1)+ψηξ ξh,t ξh,t 1 (13) − − − − e e ¡ ¢ 1 where we have adopted the linear transformations ωh,τ = ηω− ωh,τ and ξh,τ =

e e 30 1 η− ξ ,sothatω i.i.d. (0, σln c )andξ i.i.d. (0, σln c ). It is clear from (13) ξ h,τ ∼ ∗ ∼ ∗ that,e e in the absence of additional information, the separate effects of taste shocks and measurement error are not identified. We thus can define a composite error

term vh,t = ηωωh,t + ψηξξh,t, and rewrite (13) as

e e e

λ + r ln ch,t∗ =lnch,t∗ 1 + φ εh,t λεh,t 1 + ζh,t + α (vh,t vh,t 1)(14) − 1+r − − − − µ ¶

1 where we have adopted the linear transformation vh,τ = α− vh,τ ,sothatv ∼ 2 1/2 2 i.i.d. (0, σln c∗ )andα = ηω + ψηξ is an unknown parametere to estimate. µ ³ ´ ¶ Equation (14) shows thate the compositee error term induces a further reason for consumption to vary. Clearly, not only consumption dynamics changes, but the implied consumption mobility as well.16 In what follows, we will therefore in- terpret the parameter α as the amount of consumption variability induced by measurement error and taste shocks.

4 Estimation method

We now discuss estimation of the parameters of interest. One complication with the panel we use is that while income and consumption refer to calendar years, data are collected every other year from 1987 to 1995. The simulated transition

16The clearest case in which this happens is in the model with consumption insurance (where

φ = 0) and no taste shocks: measurement error can induce consumption mobility even though the growth rate of marginal utilities is constant.

31 laws for consumption must therefore be slightly modified to tackle this problem.

One can use the recursive aspect of (14) to rewrite it as:

λ + r (1 λ) r ln ch,t∗ =lnch,t∗ 2 + φ εh,t + − εh,t 1 λεh,t 2 + ζh,t + ζh,t 1 − 1+r 1+r − − − − µ ¶

+α (vh,t vh,t 2)(15) − −

The parameters to be estimated are the variances of the permanent and tran- sitory income shocks, the fraction of measurement error and taste shocks, the degree of excess sensitivity, the degree of income smoothing and the real interest rate. As for the interest rate, we assume a value of 2% throughout.

The estimation of the income and consumption process can be performed in

2 2 two steps. In a first step, we estimate the income variances σε and σζ .Inasecond step we use the estimated income variances to generate the income shocks ε and

ζ and the composite error v that appear in the consumption rule, and estimate the parameters φ, λ,andα by simulated minimum χ2 method.

As explained in Section 4, we specify the income process as ln yh,t = dt +

βXh,t + ph,t + εh,t,whereyh,t is per capita household disposable income and dt a set of time dummies. Using the 1987-95 panel, we regress ln yh,t on a set of demographic variables (North, South, a dummy for gender, a fourth-order age polynomial, and education dummies) and time dummies, so to remove the deterministic component of income. We save the residuals uh,t = ph,t + εh,t and carefully examine their covariance properties. We estimate covariances using

32 equally weighted minimum distance methods, as suggested by Altonji and Segal

(1997).17

We find that the estimated covariances are consistent with the income process in equations (4) and (5), i.e. that there is a random-walk permanent component and a serially uncorrelated transitory shock. Recall that because of the sample design of the SHIW we can only construct the covariance matrix for two years apart income residuals, uh,t uh,t 2 = ζh,t + ζh,t 1 + εh,t εh,t 2.Tocheckthe − − − − − consistency of the estimated income process with the model in equations (4) and

(5), note that the income process implies the following testable restrictions on the covariance matrix of the first difference of the income residuals:

2 2 2 E (uh,τ uh,τ 2) =2σζ +2σε − −

£ ¤ 2 E [(uh,τ uh,τ 2)(uh,τ 2 uh,τ 4)] = σε − − − − − −

E [(uh,τ uh,τ 2)(uh,τ j uh,τ j 2)] = 0 for all j 4 − − − − − − ≥

Provided that the restrictions are met in the data, one can estimate the vari-

2 ance of the transitory shock σε from the first order autocovariance of income

2 residuals and the variance of the permanent shock σζ combining information on the variance and the first-order autocovariance of the residuals. We find that 17Covariances can be estimated by equally weighted minimum distance or optimal minimum distance. As shown by Altonji and Segal (1997), the latter can produce inconsistent estimates in small samples, so we adopt the former.

33 the estimated autocovariance at the second order is very small (-0.0056) and not statistically different from zero (a t-statistic of -1.1); the autocovariance at the third order is again small (-0.0178) and not statistically different from zero (a t-statistic of -1.1). In contrast, the first order autocovariance (which provides an estimate of σ2) is precisely estimated (a t-statistics of 6.4) at -0.0794. The − ε 2 2 estimate of the overall variance (2σζ +2σε) is 0.2122 (with a t-statistics of 19.4),

2 2 so we infer that σζ =0.0267 and σε =0.0794 (with standard errors 0.0135 and

0.0123, respectively).18 These parameter estimates are broadly consistent with the evidence available for the US, where researchers have found variances of sim- ilar magnitude.19

The remaining unknown parameters are φ, the degree to which consumers are unable to insure income shocks through precautionary savings, λ, the degree of excess sensitivity of consumption, and α, the composite amount of consumption variability induced by measurement error and taste shocks. We therefore estimate

φ, λ and α minimizing the distance between the empirical and the theoretical transition matrix using the modified χ2 criterion presented in Section 2.

Since theoretical transition probabilities do not have a closed form expression, we use a simulated minimum χ2 estimation method.20 A sketch of the estimation

18Unfortunately, with data collected every two years we cannot distinguish between this income process and one where the transitory component is an MA(1) process. 19 2 2 For instance, Carroll and Samwick (1997) using the PSID, estimate σζ =0.0217andσε =

0.0440. 20Alan and Browning (2003) estimate the parameters of the Euler equation (the elasticity of

34 method is the following. We start by generating, for each household, draws for the transitory and the permanent income shocks and for the measurement error in consumption.21 The income shocks are drawn from a normal distribution with mean zero and variances equal to the estimated variance from the income

2 2 process (σε =0.0794 and σζ =0.0267, respectively). The errors vh,t and vh,t 2 − are drawn from a normal distribution with mean zero and variance equal to the variance of measured log consumption at t and t 2, respectively. The − number of draws is S = 100 for each household, for a total of HS simulated observations (H being the number of households). We then choose a starting value for the parameter vector and, for each household, compute next period consumption, ln ch,t∗ using (15). In this way, the covariance structure between individual income and individual consumption bears directly on the extent of consumption mobility. Finally, we compute the theoretical transition probabilities

(averaging across the S simulations) and obtain the parameter estimates as those that minimize the (optimal) distance between empirical and theoretical transition probabilities. Since the number of transition probabilities that we fit exceeds the intertemporal substitution and the amount of measurement error) using simulated Euler equa- tion residuals. Gourinchas and Parker (2002) estimate the coefficient of relative risk aversion and the intertemporal discount rate using a method of simulated moments conditional on the assumption that the PIH is the true consumption model. 21In each year we choose a sample size identical to the number of actual sample transitions

(for instance, 2,982 in 1991-93 and 3,211 in 1993-95).

35 number of parameters we estimate, we have overidentifying information that can be used to assess the goodness of fit of the model. Appendix 1 reports technical details about the properties of this estimator and the minimization algorithm.

The two-step procedure described in this Section provides consistent, but not fully efficient parameter estimates. In Section 5.4 we therefore check the sensi- tivity of the results estimating simultaneously income and consumption mobility, as well as the rank correlation statistics between the two variables.

5 Estimation results

In this section we report full sample estimates of the parameters of the con- sumption rule and of the transition matrix for consumption. We also check the sensitivity of the estimates if we estimate simultaneously the parameters of the income process and of the consumption rule and when we consider measurement error in income. Finally, we split the sample by educational attainment of the head, estimate a separate income process for each education group and evaluate patterns of consumption mobility of households with different levels of educational attainment.

5.1 Full sample estimates

The results of the full sample estimates are similar across periods, so we focus on the most recent one (1993-95), that also features the largest number of tran-

36 sitions. The stability of the results across different sample periods suggests that the simulations are only marginally affected by the initial distribution of con- sumption (the income process and the associated variances of the shocks are in fact assumed to be the same across the different samples).

As a preliminary analysis, we constrain the parameter space in the simulated minimum χ2 estimation method and compare the empirical and theoretical tran- sition matrices in three benchmark models: PIH (φ =1,λ = 0), rule-of-thumb

(φ =1,λ = 1) and consumption insurance (φ = 0). In these experiments, we also rule out the effect of measurement error and taste shocks by setting α =0. These benchmark models illustrate our estimation strategy and provide a gateway to the results that follow.

We find that simulated consumption mobility, as measured by the Shorrocks mobility index, is highest in the rule-of-thumb model (65%), intermediate in the case of the permanent income hypothesis (44%), and zero under consumption insurance.22 The ranking of mobility agrees with the discussion in Section 3.3 because idiosyncratic income shocks translate into consumption changes entirely in the rule-of-thumb model, partially in the PIH via intertemporal smoothing, and are fully insured in the risk sharing model. However, from a statistical point of view, none of these models is able to match the amount of empirical mobility. Cell-

22 1 The Shorrocks index, defined as S (P)=(q 1)− (q trace(P)), is a standard way of − − summarizing the extent of mobility from a transition matrix. See Shorrocks (1978).

37 by-cell comparison of the theoretical and empirical transition matrices reveals that each of the three models is rejected according to the χ2 goodness-of-fit statistics.23

To bridge the gap between simulated and empirical mobility we therefore con- sider the effect of measurement error and taste shocks in consumption, and allow for a more flexible response of consumption to income shocks than predicted by either full insurance, rule-of-thumb model or PIH. We know from equation (15), nesting the three baseline models, that raising the excess sensitivity parameter λ or the insurance parameter φ also increases consumption mobility, regardless of the size of the composite error term.

We therefore implement the simulated minimum χ2 estimation method freeing the parameter space. The parameter estimates of φ, λ and α are reported in

2 column (1) of Table 3. For completeness, we also report the estimates of σζ and

2 σε obtained in the first step. Since the restriction φ = 1 is not rejected statistically, we impose the restriction in column (2). The results indicate that the composite variability of consumption due to measurement error and taste shocks is 38% and that the excess sensitivity coefficient is 16%. Both estimates are precisely estimated and statistically different from zero at the 1% level. For values of φ =1,

λ =0.16 and α =0.38, the simulated mobility index is almost identical to the

23In the rule-of-thumb case (α =0,λ = φ = 1), the χ2 value is 58, in the PIH (α = λ =0,

φ = 1) 250, and in the consumption insurance case (α = λ = φ = 0) 2856. Each of these values

2 exceeds the critical value of χ12;0.05 =21.

38 empirical one (60.13 against 59.37%). The χ2 goodness of fit statistic (or test of overidentifying restrictions) is 15 with a p-value of 9%, indicating that the model

fits well the transition probabilities: we cannot reject the hypothesis that the empirical transition probabilities are jointly equal to the simulated ones.24

The simulation predicts almost perfectly the empirical transition matrix cell- by-cell, not just the aggregate mobility index. In Table 4 we report the simulated transition probabilities and (in parenthesis) the empirical transition probabilities, the same reported for 1993-95 in Table 2. The comparison between the two sets of numbers is striking: regardless of cell, the difference between the empirical and simulated values is at most 2 percentage points.

The estimated value of the excess sensitivity parameter (λ =0.16) is broadly consistent with previous evidence on the effect of transitory income shocks on consumption expenditure. Using CEX quarterly panel data, Souleles (1999) and

Parker (1999) examine, respectively, the response of household consumption to

24Results for other years are similar with the exception of 1991-93. In that period actual mobility increases substantially, a fact that is not captured by our simulations. One possible explanation is that the variance of the permanent shock, which is assumed to be time stationary, changed in 1993 due to the unprecedented strong recession. However, we cannot rule out that in 1993 the amount of measurement error is greater than in the other two years. Another possibility is that the 1993 recession impacted unevenly on households, a particular form of non-stationarity that we neglect in our simulation exercise.

39 income tax refunds and to predictable changes in Social Security with-holdings.

Souleles finds evidence that the marginal propensity to consume is at least 35% of refunds within a quarter, and Parker that consumption reacts significantly to changes in tax rates. In both studies, the impact of transitory income shocks is too high to be consistent with the PIH model, but in the range of estimates pro- duced by our hybrid model. Browning and Crossley (2001) survey several other studies reporting evidence that consumption overreacts to anticipated income innovations.

Our estimates allow us to characterize consumption mobility both in the short and in the long run by using recursively the transition law for consumption and the realizations of the income shocks (equation 15). To illustrate, let us consider an individual who is in the bottom consumption quartile in the initial period. With full consumption insurance, absent measurement error and taste shocks, there is no mobility across quartiles: the individual rank in the consumption distribution is forever unchanged. Figure 1 shows that in a world in which households change consumption one-for-one in response to permanent income shocks, and smooth transitory shocks by saving and dissaving (the PIH model, obtained by setting

φ =1,α =0,andλ = 0 in equation 16), there is instead a 24% probability of leaving the bottom consumption quartile in period t + 2 conditional on being in the bottom quartile in period t. Since each period the household receives new income shocks, we can generate a consumption distribution also for years

40 t +4,t + 6, and so on until t + 20 (recall that our panel and transition law for consumption span two years of data). From each distribution we then create consumption quartiles and compute the probability of moving to higher quartiles in period t +4, t + 6 and so on conditional on being in the first quartile in period t. This set of calculations traces the lowest line in Figure 1. Since the income process is non-stationary, income shocks compound and the chance of leaving the bottom quartile increases over time, up to a long-run value of 43%.

A second source of consumption mobility is the sensitivity of consumption to transitory income shocks. The intermediate line in Figure 1 is obtained using a transition law for consumption with φ =1,λ =0.16 and α = 0. Although the line lies above the one estimated for the PIH, the distance between the two is rather small, reflecting a small estimate for λ. Measurement error and taste shocks represent a third source of consumption mobility. The upper line in Figure

1 plots the estimated probability of moving from the lowest quartile for the full model (φ =1,λ =0.16 and α =0.38,asinTable3).Thedistancebetween this line and the intermediate line (with α = 0) indicates that this composite term adds about 10 percentage points to the probability of leaving the bottom consumption quartile. Notice also that the probability in t + 2 is 33%, matching the actual value (0.33 = 1 p11 in Table 5) and that measurement error impacts − equally short and long-run mobility.

41 5.2 Simultaneous estimation of consumption and income

mobility

So far, we have followed the two-step estimation strategy described in Section

4. The first step uses covariance restrictions of the dynamic income process to

2 2 estimate σζ and σε, the variance of permanent and transitory income shocks. In the second step we estimate the parameters of the consumption rule, conditional

2 2 on the estimates of σζ and σε obtained in the first step. Under our maintained assumptions this procedure is consistent but not fully efficient. For example, we do not use information of the income transition matrix which may be interesting − in its own right or information on the joint behavior of consumption and income − growth, if not through the dynamic consumption rule embedded in (15).

Here we extend our procedure and estimate the parameters of the income and consumption process simultaneously. We consider the restrictions that the theory imposes on the consumption transition matrix, the income transition matrix, and

42 the Spearman joint rank correlation of income and consumption growth.25 The estimation method is an extension of the one described in Appendix 1, and it is detailed in Appendix 3.

Table 5 reports the results of this exercise. The estimated variance of per-

2 manent income shocks, σζ , is similar to the one estimated with the two-step

2 procedure; the variance of transitory shocks, σε, is slightly smaller (0.050 against

0.079) and more precisely estimated. The parameters of the consumption process

φ, λ,andα are remarkably close to those reported in Table 3, and yield therefore a similar interpretation.

The goodness of fit statistics increases (there are now 20 degrees of freedom, instead of 9), which signals that the model could be improved. This can be seen

25The Spearman index of the joint rank correlation between two variables x and y is defined as

n n+1 2 r (xi) r (yi) n R = i=1 − 2 n 2 n+1 2 n 2 n+1 2 r (xiP) n ¡r (yi¢) n i=1 − 2 i=1 − 2 r hP ¡ ¢ ihP ¡ ¢ i where r (.) is the rank and n the sample size.

The Spearman joint rank correlation index is distribution-free and is not affected by influential values. In terms of our competing theories, and in the absence of measurement error and taste shocks, a full consumption insurance model would suggest an index close to zero; a rule of thumb model would suggest an index close to 1. A hybrid model suggests values between these two extremes.

43 from Table 6, where we report consumption and income transition probabilities, as well as the Spearman joint rank correlation index, predicted by our estimation strategy. The figures in parenthesis are the corresponding statistics computed from the actual data. Our model fits quite well the Spearman joint rank corre- lation, which is 0.42. As before, we are also able to fit consumption transition probabilities remarkably well given our estimated parameters. In contrast, the fit of the income transition matrix is not as good. For example, the theoretical model suggests that the probability of remaining in the third quartile of the income dis- tribution should be 41%. In practice, in the data there is more persistence (48%).

We leave investigations of the reasons for these discrepancies to future research, and in the remaining of the paper focus on the two-step procedure.

5.3 Measurement error in income

In this section we consider the robustness of our conclusions in the presence of measurement error in income. This error inflates the variance of the transitory shock but does not affect the variance of the permanent shock. To see this point, assume that true income is measured with a multiplicative error: ln yh,t∗ = ln yh,t + νh,t,whereνh,t is an independently and identically normally distributed

2 measurement error with mean zero and variance σν. Usingtheincomeprocess

(4)-(5): ln yh,t∗ = βXh,t + ph,t + εh,t + νh,t, the two years apart income residual is now: uh,t uh,t 2 = ζh,t + ζh,t 1 + εh,t εh,t 2 + νh,t νh,t 2. The covariance − − − − − − −

44 matrix of the first difference of the income residuals depends now on the variance of the measurement error:

2 2 2 2 E (uh,τ uh,τ 2) =2σζ +2σε +2σν − −

£ ¤ 2 2 E [(uh,τ uh,τ 2)(uh,τ 2 uh,τ 4)] = σε σν − − − − − − −

E [(uh,τ uh,τ 2)(uh,τ j uh,τ j 2)] = 0 for all j 4 − − − − − − ≥

However, it can be checked that measurement error inflates the estimated

2 variance of the transitory shock by σν, but not the variance of the permanent

2 shock σζ, which is still identified by the difference between the variance and

(minus twice) the first-order autocovariance. The conclusion is that even though the estimate of the variance of the permanent shock is unaffected by serially uncorrelated measurement error, the estimate of the variance of the transitory shock is not.

This implies that in the model with full consumption insurance, idiosyncratic income shocks play no role regardless of measurement error in income. In the permanent income model, the impact of measurement error in income is bound to be small, because transitory shocks play a very limited role. In contrast, measurement error may have a large impact in the rule-of-thumb model. Since

2 we cannot identify σν from the data, we repeat our simulation: (a) dropping the self-employed from the sample on which we estimate the income process,26 and

26Brandolini and Cannari (1994) note that in the SHIW income from self-employment is less

45 (b) downsizing the variance of the transitory shock, i.e., assuming that one third or one half of the estimated first-order autocovariance reflects measurement error.

The results of these experiments are very similar to the simulations reported in

Tables 3 and 4 and are not reported for brevity.

5.4 Group estimates

Except for the extreme case of consumption insurance, models with incomplete insurance suggest that if different population groups are systematically exposed to different idiosyncratic shocks (and therefore face different income processes), consumption mobility should differ across groups in a predictable way.27 There- fore, comparison of different population groups with different income generating process is potentially quite interesting. Indeed, even more compelling evidence for the ability of our simulations in explaining consumption transitions comes from comparing consumption mobility in two education groups: compulsory schooling or less and high school or college degree.

Focus on education is warranted for at least three reasons: (1) education is an exogenous characteristic by which one can partition the sample; (2) there is wide evidence that different education groups face different earnings opportu- nities and uncertainties; (3) education is likely to be correlated with variables well estimated than wages or salaries. 27Attanasio and Davies (1996) also exploit predictable differences between education groups to provide insights about the consumption insurance hypothesis.

46 affecting preferences and therefore with different consumption behavior. We run the income regressions separately for households headed by individuals with high and low education. We then estimate the autocovariance matrix as explained

2 2 in Section 4, and find σζ =0.0296 and σε =0.0754 for the less well educated,

2 2 and σζ =0.0198 and σε =0.0895 for those with at least a high school degree.

The estimated variances signal that the less well educated face a higher variance of permanent income shocks, a pattern also uncovered by Carroll and Samwick

(1997) with US data. Since in our sample the income process varies considerably by education groups, we have an ideal setting to test the validity of models of intertemporal choice and of the robustness of our procedure.

The transition probabilities reported in the two lower panels of Table 7 indi- cate that also consumption mobility differs between the two groups in a systematic way. Applying the test on difference of transition probabilities outlined in Section

2, we reject the hypothesis that the two matrices are equal at the 1% significance level.

Quite clearly, the consumption insurance model is unable to explain differences in consumption mobility emerging from income shocks, transitory or permanent.

In that model all shocks are insured, and provided measurement error or taste shocks are the same in both groups, consumption mobility between two groups exposed to different shocks should be identical. Therefore the fact that mobility is higher in the group with lower education provides further evidence against the

47 consumption insurance model.28 For quite different reasons, the rule-of-thumb model with λ = 1 (or any model where excess sensitivity to transitory income shocks plays a prominent role) predicts little or no difference between education groups. In the simulations the lower variance of the transitory shock for the less well educated is offset by a higher variance of the permanent shock, resulting in approximately the same mobility rates in the two groups.

We therefore estimate equation (15) and the associated consumption transi- tions allowing for differential response between the two education groups. The parameter estimates of the simulated minimum χ2 method and the associated χ2 statistic are reported in Table 7. Also in this case we cannot reject the hypothesis that φ = 1 in each of the two groups. We find a value of α =0.38 (s.e. 0.01) and λ =0.4 (0.05) in the group with low education and α =0.28 (0.01) and

λ =0.09 (0.05) in the group with high school or college degree. The model repli- cates quite well also the difference in empirical and simulated mobility between the two groups: the simulated mobility index (0.64 and 0.55 for low and high education, respectively) is quite close to empirical mobility in each group. And in each of the two cases the χ2 statistic does not reject the null hypothesis that the simulated probabilities are equal the empirical ones at the 1% significance

28The different estimates of α in the two groups will generate some differences in estimated mobility. However, unlike the estimates of λ,theα estimates are fairly similar in the two groups, and so this is unlikely to be a relevant explanation.

48 level.

As a final check of the validity of the estimates, we test whether the parameters are the same in the two groups. Call θh and θl the k 1 vectors of simulated × minimum χ2 estimates of θ for high- ande low-educatede individuals. Given the asymptotic normal distribution of the estimator and the fact that the two samples are independent, the null hypothesis of no group difference can be tested using the statistic:

1 0 − θh θl var θh +var θl θh θl − − ³ ´ ³ ³ ´ ³ ´´ ³ ´ e e e e e e 2 which is distributed χk under the null. The test statistic, reported in the last row of the first panel of Table 7, rejects the null hypothesis of parameter equality.

The other two panels of Table 7 report the simulated transition probabilities and (in parenthesis) the empirical transition probabilities for the two education groups. Once more, each of the simulated probabilities is remarkably close to the empirical transitions irrespective of the group considered.

From an economic point of view, the result that the less well educated indi- viduals are more responsive to transitory income shocks than the high income group is of particular interest. To the extent that these households are less likely to have access to credit and insurance markets than households with higher ed- ucation, our findings support the hypothesis that excess sensitivity stems from

49 the effect of borrowing constraints, rather than from other sources.29 We also

find that the estimate of α is higher for the less well educated. Whether this reflects a tendency to report noisier consumption data or a greater incidence of taste shocks in this group is, however, an issue that cannot be settled here.

5.5 Relation with previous tests

It is useful to contrast our approach with previous tests of models of intertemporal choice. First of all, our simulation method produces estimates of the propensity to consume out of transitory and permanent income shocks. These parameters are of great policy interest, for instance to evaluate the effect of a tax cut or other changes in the household budget constraint. Excess sensitivity of consumption has sometimes been inferred from the income growth coefficient in Euler equations estimates. However, there is much disagreement concerning the interpretation of the excess sensitivity parameter due to various identification problems in the estimation of the Euler equation (Attanasio, 2000). While the Euler equation literature is concerned with estimation of preference parameters derived from the first order conditions of the consumers’ optimization problem, we attempt at

29Jappelli, Pischke and Souleles (1998) report evidence from the Survey of Consumer Finances that individuals with less than college degree are more likely to be turned down for loans, to have no credit card, or to have no line of credit. The same households have also less assets relative to income, an indicator that has often been interpreted as bearing on the incidence of borrowing constraints.

50 estimating the parameters of the consumption rule. This does not come without costs, however. We make specific assumptions about preferences and the income generating process, and our estimates are therefore conditional on the validity of the theoretical framework and on the stability of the income process. This paper is therefore part of a growing literature in macroeconomics that attempts to estimate structural (or semi-structural) models by means of simulated estimation methods.

Second, and for quite different reasons, our approach to test for consump- tion insurance differs from previous tests based on regression analysis. Cochrane

(1991), Mace (1991), Townsend (1994), Attanasio and Davies (1996), and Zhang and Ogaki (2001) regress household consumption growth on aggregate variables and idiosyncratic shocks (such as change in disposable income, unemployment hours, and days of illness). The implication of the theory is that none of these shocks should impact household consumption growth, as in equation (9). Fo- cussing instead on the relation between consumption insurance and consumption mobility has two advantages: (1) we don’t need to identify explicitely any of these shocks, and (2) we don’t need to assume that they are uncorrelated with unobserv- able or omitted preference shocks, including household fixed effects. Moreover, measurement error in the shock variables biases tests based on regression analysis towards the null hypothesis of full consumption insurance; our testing strategy is instead robust to such problem. Of course, as we have clarified, our approach re-

51 quires to make specific (and untestable) assumptions on how taste shocks impact the marginal utility of consumption.

Deaton and Paxson (1994) and Attanasio and Jappelli (2001) test another implication of the theory of consumption insurance, i.e., that the cross-sectional variance of consumption is constant over time. However, the distribution of con- sumption at time t might have the same variance of the distribution at time t 1 even if there is mobility in the underlying distributions.30 Tests based on − the dynamics of the cross-sectional variance of consumption are therefore biased towards the null. Our test instead still signals rejection of the consumption in- surance model even in situations in which the cross-sectional variance is constant over time but there is mobility in the underlying distribution.

6Summary

The implications of the theories of intertemporal consumption choice for con- sumption mobility are as yet unexplored. In this paper we study transition prob- abilities for total non-durable consumption using the 1987-95 panel contained in the Bank of Italy Survey of Household Income and Wealth. The panel data allow us to calculate an empirical transition matrix of log per capita consumption. The

30For instance, suppose that a poor and a rich household switch ranks in the consumption distribution. This will not change the cross-sectional variance of consumption but represents a violation of consumption insurance.

52 matrix shows that there is substantial consumption mobility: in any year, about

60% of the households moves up or down in the consumption distribution.

In the remainder of the paper we attempt to understand which model of in- tertemporal consumption choice is capable of explaining the amount of consump- tion mobility we observe in the data. From the theoretical point of view, the consumption insurance model provides the clearest implications for consumption mobility. In a model where all idiosyncratic income shocks are insured, the initial cross-sectional distribution of consumption is a sufficient statistic for all future distributions, and therefore, apart from measurement error in consumption and taste shocks, the model predicts zero consumption mobility. On the other hand, the rule-of-thumb model is one where income shocks have the greatest impact on consumption; it therefore generates substantial consumption mobility. Finally, in models with optimizing agents and incomplete markets (such as the permanent income model or models with precautionary saving) households react mainly to permanent income shocks. Thus, the degree of mobility predicted by the model is intermediate between the two other models.

We carefully parametrize an income process to distinguish between transitory and permanent shocks and use the estimated parameters to simulate theoretically the degree of mobility stemming from each of the consumption models examined.

We then compare them statistically with the actual amount of mobility estimated in the data. The results reject statistically each of the simple representations of

53 the consumption decision rule, and reveal that households smooth income shocks to a lesser extent than implied by the PIH. A noteworthy feature of our method is that the estimates are robust to the presence of measurement error in consumption and taste shocks, although we cannot identify the separate contribution of these two components on consumption dynamics.

Several criteria suggest that our estimates describe the dynamics of the con- sumption distribution remarkably well. First, the estimates are able to match the empirical transition matrix cell by cell. Second, the results are robust with respect to different definitions of consumption (in per capita or per adult equiv- alent terms), to the presence of measurement error in income, taste shocks and to various other sensitivity checks on sample exclusions and definitions. Third, the results do not change when we estimate income and consumption mobility simultaneously, instead of relying on a two-step procedure. Finally, and most im- portantly, the group-specific estimates by education match the different patterns of consumption mobility we find in the data.

There are three important by-products of our analysis. First, we produce estimates of the sensitivity of consumption to permanent and transitory income shocks that are potentially useful to evaluate fiscal policy experiments that affect the timing of income receipts and, more generally, households’ budget constraints.

In this respect, we find considerable asymmetric response to transitory income shocks by education groups: a low response in the group with higher education

54 and a relatively high response for households with lower education.

Second, we provide a powerful test of the consumption insurance model. So far these tests have focused on mean and variance restrictions of the distribution of consumption growth. Mean restrictions require consumption growth to be orthogonal, on average, to idiosyncratic income shocks. If shocks are measured with error, however, these tests are biased towards the null hypothesis of full consumption insurance. Variance restrictions require the cross-sectional variance of consumption growth to be constant over time. But the variance might be stationary even if the underlying consumption distribution is shifting. Thus, variance restriction tests too are biased towards the null. Our test is free from these problems, because we look at the entire consumption distribution, not just its mean or variance. On the other hand, the implementation of this test and, more generally, the evaluation of consumption mobility requires genuine panel data and suitable assumptions about the distribution of measurement error and taste shocks, while mean and variance restriction tests can be performed with repeated cross-sectional data.

Finally, the estimates could be used to single out the separate contributions of incomplete markets, excess sensitivity, measurement error and taste shocks in generating the short and long run consumption mobility we observe in the panel data. One important question is to what extent the failure of complete markets is due to the unwillingness of society to forgo social mobility, an issue that we

55 plan to explore in future research.

56 A The simulated minimum χ2 estimator

Let P (θ)theq q transition matrix with typical element pij (θ), where θ is a × vector of k unknown parameters:

p11 (θ) p12 (θ) ... p1q (θ)    p21 (θ) p22 (θ) ... p2q (θ)  P (θ)=       ......         pq1 (θ) pq2 (θ) ... pqq (θ)      Conformably with P (θ)letP the q q empirical transition matrix with typical × element pij: b

b

p11 p12 ... p1q    pb21 pb22 ... pb2q  P =        b...b ...... b ...  b        pq1 pq2 ... pqq      b b b q The transition matrices P (θ)andP are subject to the restrictions j=1 pij (θ)=

q P 1and j=1 pij =1(i =1...q), respectively.b This creates a singularity problem P similar to theb one in the estimation of a full demand system. To avoid this problem, we drop one column (say, the q-th column) from both P (θ)andP.

Let p (θ)theq (q 1) 1 vector of true transition probabilities andb con- − × formably with p (θ)letp the q (q 1) 1 vector of estimated transition proba- − × b 57 bilities. The distance between the empirical and true transition probabilities is d (θ)=p p (θ), whose covariance matrix Ω (θ) is block diagonal with generic − 31 block: b

pi1(θ)(1 pi1(θ)) pi1(θ)pi2(θ) pi1(θ)piq 1(θ) − ... − ni ni ni  − −  pi2(θ)(1 pi2(θ)) pi2(θ)piq 1(θ) − ... −  ni ni  Ωi (θ)= −       ......     piq 1(θ)(1 piq 1(θ))   − − −   ni      for i =1...q (we assume ni = n/q is an integer for simplicity), so that:

Ω1 (θ) 0 ... 0    Ω2 (θ) ... 0  Ω (θ)=       ......         Ωq (θ)      From Chamberlain (1982), the minimum χ2 method solves the problem:

min d (θ)0 Wd (θ) θ where W is a weighting matrix. Call θ the minimum χ2 estimate of θ.Cham- berlain (1982) and others show that θ bis consistent, asymptotically normal with covariance matrix: b 31We neglect the extra randomness induced by the fact that the class boundaries are pre- estimated.

58 1 1 − − var θ = G θ 0 WG θ G θ 0 WΩ (θ) WG θ G θ 0 WG θ ³ ´ µ ³ ´ ³ ´¶ ³ ´ ³ ´ µ ³ ´ ³ ´¶ b b b b b b b

where G θ = ∂d θ /∂θ0 is the gradient matrix. It is a well known result that

³ ´ ³ ´ 1 the optimalb weightingb matrix (in the efficiency sense) is Ω (θ)− .Inthiscase:

1 0 1 − var θ = G θ Ω (θ)− G θ ³ ´ µ ³ ´ ³ ´¶ b b b In our case p (θ) has no closed form, so we replace it with an approximation based on simulations, as in the simulated method of moments (McFadden, 1989;

Duffie and Singleton, 1991). Recall that the generic pij (θ)is:

pij (θ)=Pr ln ch,t∗ i ln ch,t∗ 2 j, θ ∈ − ∈ ¡ ¯ ¢ ¯ e.g., the probability of making a transition to class i from class j conditioning on being in class j. The transition law for consumption is determined by (15), reproduced here:

λ + r (1 λ) r ln ch,t∗ =lnch,t∗ 2 + φ εh,t + − εh,t 1 λεh,t 2 + ζh,t + ζh,t 1 − 1+r 1+r − − − − µ ¶

+α (vh,t vh,t 2) − −

32 where in our case θ =(φαλ)0. For simulation purposes, we assume εh,τ ∼ 32We neglect the problems associated with the fact that r is given, and that σ2 , σ2 , ln ch,t∗ 2 ln ch,t∗ −

2 2 σε and σζ are pre-estimated.

59 2 2 2 N (0, σε), ζh,τ N 0, σζ , vh,τ N 0, σln c τ. ∼ ∼ h,∗ τ ∀ ¡ ¢ ³ ´ By construction, the normality of the income shocks and of measurement error generates a symmetric transition matrix for consumption. This feature of the simulations is consistent with the symmetry of the empirical matrix documented in Table 2. Our results do not depend on the normality assumption. We choose normality for simplicity, but note that any symmetric distribution would work as well, because it would imply a symmetric transition matrix.

Define uh = εh,t εh,t 1 εh,t 2 ζh,t ζh,t 1 vh,t vh,t 2 0 the vector of disturbances. − − − − ¡ ¢ For each household h,wedrawS independent realizations of uh, and store the

HS realizations (H being the number of households).33 It is necessary to keep

s these basic drawings of uh fixed when θ changes, in order to have good numerical and statistical properties of the estimators based on the simulations.

s Conditioning on the measured (not simulated) ln ch,t∗ 2,thesimulateduh,and − a choice for θ,oneobtains

s λ + r s (1 λ) r s s s s ln ch,t∗ =lnch,t∗ 2 + φ εh,t + − εh,t 1 λεh,t 2 + ζh,t + ζh,t 1 − 1+r 1+r − − − − µ ¶ s s +α vh,t vh,t 2 (A1) − − ¡ ¢ e e s 1 S s This allows computation of pij (θ). One can then define pij (θ)=S− s=1 pij (θ) P as the approximation of pij (θ) obtained by means of simulations.

33In each year we choose a sample size identical to the number of actual sample transitions

(for instance, it is 2,982 in 1991-93 and 3,211 in 1993-95).

60 Call the simulated distance d (θ)=p p (θ)wherep (θ) is the vector of − simulated transition probabilities with genericb element pij (θ). Note that the co-

a.s. 1 1 variance matrix of d (θ), Ω (θ) (1 + S− ) Ω (θ) where (1 + S− )isaninflating → factor of the variance of the true distance vector induced by the additional ran- domness of the simulations. With a large enough number of simulations, however, this factor plays little weight in practice.

Thechoiceofθ minimizes the simulated minimum χ2 criterion

1 min d (θ)0 Ω (θ)− d (θ) θ

Call θ the resulting solution. Then, the results in Lee and Ingram(1991),

McFadden(1989),e and Duffie and Singleton (1993) imply that θ is consistent, asymptotically normal with covariance matrix: e

1 1 1 − var θ = 1+ G θ 0 Ω θ − G θ S ³ ´ µ ¶· ³ ´ ³ ´ ³ ´¸ e e e e Goodness of fit can be assessed using:

1 0 − 2 m = d θ Ω θ d θ χq(q 1) k ∼ − − ³ ´ ³ ´ ³ ´ e e e Note that when q (q 1) >kas in our empirical application, this goodness − of fit statistics can be interpreteted as an overidentifying restriction statistics.

61 This is because we estimate k parameters but minimize the distance between q (q 1) >kactual and theoretical transition probabilities. − The algorithm that we implement is thus the following:

s 1. Draw uh (h =1...H, s =1...S).

2. Choose a starting value for θ,sayθ0.

3. Compute ln c∗ using (A1), p (θ0), and dij (θ0)=pij p (θ0)(i =1...q, h,t ij − ij j =1...q 1). − b 1 4. Compute d (θ0)0 Ω (θ0)− d (θ0).

5. Update the value of θ.

6. Repeat steps 3-5 until a pre-specified convergence criterion is met. Even- tually this provides the required simulated minimum χ2 estimate θ of θ.

We update the value of θ using the simulated annealing methode of Goffe,

Ferrier and Rogers (1994).34 This is a derivative-free minimization method that escapes local minima. Starting from an initial value, the algorithm takes a step and evaluates the function. Downhill steps are always accepted, while uphill steps are accepted probabilistically according to the Metropolis criterion. As the algorithm proceeds, the length of the step declines until the χ2 reaches the global minimum. 34We use the Gauss code on simulated annealing written by E.G. Tsionas and available at http://www.american.edu/academic.depts/cas/econ/gaussres/optimize/optimize.htm.

62 BTestequivalence

Here we prove the statement in Section 2 that the χ2 goodness of fit criterion (2):

q q 2 (pij pij (θ)) ni − (A2) p (θ) i=1 j=1 ij X X b 1 is equivalent to (p p (θ)) Ω (θ)− (p p (θ))0. − − 2 Note firstthat(A2)isthesumofb b q independent χ distributions of the form

q 1 2 2 mi = nip− (pij pij (θ)) .Thesumofq independent χ distributions is j=1 ij − P2 also a χ distributionb with degrees of freedom equal to the sum of the degrees of freedom of the χ2 distributions that are summed.

Notice that the theoretical and empirical transition probabilities are subject

q q to the restrictions j=1 pij (θ)=1and j=1 pij =1(i =1...q), respectively.

q P 1 2P Thus mi = nipij (θ)− (pij pij (θ)) can be rewritten as: j=1 − b P b

q 1 q 1 2 − n n − m = i (p p (θ))2 + i (p p (θ)) i p (θ) ij ij p (θ) ij ij j=1 ij − iq " j=1 − # X X q 1 q 1 q 1 − n b n − b − = i (p p (θ))2 + i (p p (θ)) (p p (θ)) p (θ) ij ij p (θ) ij ij ij ij j=1 ij − iq j=1 − j=1 − X X X b b b or more compactly:

mi = di (θ)0 Ai (θ) di (θ)+di (θ)0 Bi (θ) di (θ)

= di (θ)0 Λi (θ) di (θ)

63 where di (θ)=pi pi (θ) is the distance between empirical and true transition − probabilities inb row i of the transition matrix (excluding the q-th column), and

Λi (θ)=Ai (θ)+Bi (θ), with:

ni 0 ... 0 p11(θ)   0 ni ... 0  p12(θ)  Ai (θ)=       ......       00... ni   piq 1(θ)   −    and Bi (θ)=(ni/piq (θ)) ii0,wherei is a (q 1) 1 vector of ones, so that Bi (θ) − × is a matrix that contains (ni/piq (θ)) everywhere. It’s easy to prove that Λi (θ)=

1 Ωi (θ)− defined in Appendix B. Since asymptotically di (θ) N (0, Ω (θ)), it ∼ i follows that:

q 2 (pij pij (θ)) 1 m = n − = d (θ)0 Ω (θ)− d (θ) i i p (θ) i i i j=1 ij X b is distributed χ2 with (q 1) degrees of freedom. Moreover: −

q q 2 q q (pij pij (θ)) 1 1 ni − = mi = di (θ)0 Ωi (θ)− di (θ)=d (θ)0 Ω (θ)− d (θ) p (θ) i=1 j=1 ij i=1 i=1 X X b X X is distributed χ2 with q (q 1) degrees of freedom. This is exactly the function − that we minimize in the simulated minimum χ2 application. This proves the

q q 1 2 1 equivalence between nipij (θ)− (pij pij (θ)) and d (θ)0 Ω (θ)− d (θ). i=1 j=1 − P P b 64 2 q q 1 2 An alternative to the minimum χ criterion nipij (θ)− (pij pij (θ)) i=1 j=1 − 2 Pq Pq 1 2 is to use the modified minimum χ criterion nip− (pij pij (θ)) .Fol- i=1 j=1 ij − b P P lowingthesamestepsabove,onecanshowthat: b b

q q 2 (pij pij (θ)) 1 ni − = d (θ)0 Ω− d (θ) p i=1 j=1 ij X X b b where Ω is a block-diagonal matrixb with generic block:

b

pi1(1 pi1) pi1pi2 pi1piq 1 − ... − ni − ni − ni  b b b b b b  pi2(1 pi2) pi2piq 1 − ... −  ni ni  Ωi =  −   b b b b     ......  b    piq 1(1 piq 1)   − − −   ni     b b  a.s. Since pij is a consistent estimate of pij (θ), Ω Ω (θ). In the estimation, → 2 we use theb modified simulated minimum χ criterion,b i.e. use Ω (based on the empirical transition probabilities) as an estimate of Ω (θ). b

C The simulated minimum χ2 estimator in the

extended case

Let Px (θ)beaq q transition matrix for variable x with typical element px (θ); × ij x x x P is its empirical analog with typical element pij.Letp (θ)bethestacked qb(q 1) 1 vector of true transition probabilities obtained after dropping the q- − × b

65 th column of the transition matrix Px (θ) to avoid singularity; and px its empirical analog. b

Define

pc (θ) pc     p (θ)= y and p = y  p (θ)   pb               R (θ)  b  bR          where superscripts c and y refer to (log) consumptionb and income, respectively,

R (θ) is the Spearman joint rank correlation of true consumption and income growth, and R its empirical analog.

The distanceb between empirical and true statistics is d (θ)=p p (θ), whose − 35 covariance matrix Ω (θ) we assume to be block-diagonal b

Ωc (θ) 00   Ω (θ)= y  Ω (θ) 0         ω (θ)      with ω (θ)= 1 R (θ)2 / (n 2), − − 35We neglect¡ the extra randomness¢ induced by the fact that the class boundaries are pre- estimated. We also ignore correlation between consumption transition probabilities and income transition probabilities.

66 x Ω1 (θ) 0 ... 0   x  Ω2 (θ) ... 0  Ωx (θ)=       ......       x   Ωq (θ)      and

px (θ) 1 px (θ) x x px (θ)px (θ) i1 ( i1 ) pi1(θ)pi2(θ) i1 iq 1 − ... − ni ni ni  − −  x x x x pi2(θ)(1 pi2(θ)) pi2(θ)piq 1(θ) − ... −  ni ni  Ωx (θ)= −  i      ......    x x  piq 1(θ)(1 piq 1(θ))   − − −   ni      for i =1...q (we assume ni = n/q is an integer for simplicity).

The estimation strategy now proceeds as explained in Appendix 1. The only crucial difference is that we simulate both consumption growth and (residual) income growth using the transition laws

λ + r (1 λ) r ln ch,t∗ =lnch,t∗ 2 + φ εh,t + − εh,t 1 λεh,t 2 + ζh,t + ζh,t 1 − 1+r 1+r − − − − µ ¶

+α (vh,t vh,t 2) − − and

67 ln yh,t =lnyh,t 2 +(dt dt 2)+β (Xh,t Xh,t 2)+ζh,t + ζh,t 1 + εh,t εh,t 2. − − − − − − − −

68 References

[1] Alan, Sule, and Martin Browning (2003). “Estimating Euler Equations Us-

ing Simulated Residual Estimation.” CAM Working Paper 3, University of

Copenhagen.

[2] Altonji, Joseph G., and Lewis M. Segal (1996). “Small-Sample Bias in GMM

Estimation of Covariance Structures.” Journal of Business and Economic

Statistics 14, 353-66.

[3] Anderson, Theodore W., and Leo A. Goodman (1957). “Statistical Inference

about Markov Chains.” Annals of Mathematical Statistics 28, 89-110.

[4] Attanasio, Orazio P. (2000). “Consumption.” In Handbook of Macroeco-

nomics, John B. Taylor and Michael Woodford. Amsterdam: Elsevier Sci-

ence.

[5] Attanasio, Orazio P., and Stephen J. Davis (1996). “Relative Wage Move-

ments and the Distribution of Consumption.” Journal of Political Economy

104, 1227-62.

[6] Attanasio, Orazio P., and Tullio Jappelli (2001). “Intertemporal Choice and

the Cross-Sectional Variance of Marginal Utility.” Review of Economics and

Statistics 83, 13-27.

69 [7] Banerjee, Abhijit V. and Andrew F. Newman (1991). “Risk-Bearing and the

Theory of Income Distribution.” Review of Economic Studies 58, 211-35.

[8] Bishop Yvonne M., Stephen E. Fienberg, and Paul W. Holland (1975), Dis-

crete multivariate analysis: Theory and Practice.Cambridge:MITPress.

[9] Blundell, Richard W., and Ian Preston (1998). “Consumption Inequality and

Income Uncertainty.” Quarterly Journal of Economics 113, 603-640.

[10] Blundell, Richard W., Luigi Pistaferri, and Ian Preston (2003). “Partial In-

surance, Information, and Consumption Dynamics.” CEPR Discussion Pa-

per 3666.

[11] Brandolini, Andrea, and Luigi Cannari (1994). “Methodological Appendix:

The Bank of Italy Survey of Household Income and Wealth.” In Saving and

the Accumulation of Wealth. Essays on Italian Households and Government

Behavior, A. Ando, L. Guiso and I. Visco eds. Cambridge: Cambridge Uni-

versity Press.

[12] Browning, Martin, and Thomas F. Crossley (2001). “The Life-Cycle Model

of Consumption and Saving.” Journal of Economic Perspectives 15, No. 3,

3-22.

[13] Campbell, John Y., and Gregory N. Mankiw (1989). “Consumption, Income,

and Interest Rates: Reinterpeting the Time-Series Evidence.” In NBER

70 Macroeconomics Annual 1989, Olivier J. Blanchard and Stanley Fischer eds.,

185-216. Cambridge: MIT Press.

[14] Carroll, Christopher D. (2001). “Precautionary Saving and the Marginal

Propensity to Consume out of Permanent Income.” NBER Working Paper

8233.

[15] Carroll, Christopher D., and Andrew A. Samwick (1997). “The Nature of

Precautionary Wealth.” Journal of Monetary Economics 40, 41-72.

[16] Cochrane, John (1991), “A Simple Test of Consumption Insurance.” Journal

of Political Economy 99, 957-976.

[17] Cutler, David M., and Lawrence F. Katz (1992). “Rising Inequality?

Changes in the Distribution of Income and Consumption in the 1980s.”

American Economic Review 82, 546-51.

[18] Deaton, Angus (1997). The Analysis of Household Surveys. Baltimore: The

Johns Hopkins University Press.

[19] Deaton, Angus, and Christina Paxson (1994). “Intertemporal Choice and

Inequality.” Journal of Political Economy 102, 384-94.

[20] Duffie, Darrell and Kenneth Singleton (1993). “Simulated Moments Estima-

tion of Markov Models of Asset Prices.” Econometrica 61, 929-52.

71 [21] Formby, John P., W. James Smith, and Buhong Zheng (2001). “Mobility

Measurement, Transition Matrices, and Statistical Inference.” University of

Colorado at Denver, Working Paper n. 2001-04.

[22] Friedman, Milton (1957). A Theory of the Consumption Function.Princeton:

Princeton University Press.

[23] Goffe, William, Gary Ferrier and John Rogers (1994). “Global Optimization

of Statistical Functions with Simulated Annealing.” Journal of Econometrics

60, 65-99.

[24] Gourinchas, Pierre-Olivier and Jonathan Parker (2002). “Consumption Over

theLifeCycle.”Econometrica 70, 47-91.

[25] Hall, Robert E., and Frederic Mishkin (1982). “The Sensitivity of Consump-

tion to Transitory Income: Evidence from PSID Households.” Econometrica

50, 461-81.

[26] Heathcote, Jonathan, , and Gianluca Violante (2003).

“The Welfare Implications of Rising Wage Inequality in the US.” mimeo,

IIE, Stockholm University.

[27] Jappelli, Tullio, Stephen Pischke, and Nicholas Souleles (1998). “Testing

for Liquidity Constraints in Euler Equations with Complementary Data

Sources.” The Review of Economics and Statistics 80, 251-62.

72 [28] Johnson, David, and Stephanie Shipp (1997). “Trends in Inequality Using

Consumption Expenditures: the US from 1960 to 1993.” Review of Income

and Wealth 43, 133-151.

[29] Kimball, Miles S. (1990). “Precautionary Saving in the Small and in the

Large.” Econometrica 58, 53-73.

[30] Krueger, Dirk, and Fabrizio Perri (2003). “On the Welfare Consequences

of the Increase in Inequality in the United States.” NBER Macroeconomics

Annual 2003. Cambridge: MIT Press.

[31] Laibson, David (1997). ”Golden Eggs and Hyperbolic Discounting.” Quar-

terly Journal of Economics 112, 443-78.

[32] Lee, Bong-Soo and Beth Ingram (1991). “Simulation Estimation of Time

Series Model.” Journal of Econometrics 47, 197-205.

[33] Lillard, Lee, and Robert Willis (1978). “Dynamic Aspects of Earning Mobil-

ity.” Econometrica 46, 985-1012.

[34] Mace, Barbara (1991). “Full Insurance in the Presence of Aggregate Uncer-

tainty.” Journal of Political Economy 99, 928-56.

[35] Mas-Colell, Andreu, Michael Whinston, and Jerry Greene (1995). Microeco-

nomic Theory. Oxford: Oxford University Press.

73 [36] McFadden, Daniel (1989). “A Method of Simulated Moments for Estimation

of Discrete Response Models Without Numerical Integration.” Econometrica

57, 995-1026.

[37] Neyman, Jerzy (1949). “Contributions to the Theory of the χ2 Test.” In:

Proceedings of the Berkeley Symposium on Mathematical Statistics and Prob-

ability. Berkeley: University of California Press.

[38] Parker, Jonathan A. (1999). “The Reaction of Household Consumption to

Predictable Changes in Social Security Taxes.”

89, 959-73.

[39] Phelan, Christopher (1994). “Incentives, Insurance, and the Variability of

Consumption and Leisure.” Journal of Economic Dynamics and Control 18,

581-99.

[40] Shorrocks, Anthony F. (1978). “The Measurement of Mobility.” Economet-

rica 46, 1013-24.

[41] Silverman, Bernard W. (1986). Density Estimation for Statistics and Data

Analysis. Chapman and Hall.

[42] Souleles, Nicholas S. (1999). “The Response of Household Consumption to

Income Tax Refunds.” American Economic Review 89, 947-58.

74 [43] Townsend, Robert (1994). “Risk and Insurance in Village India.” Economet-

rica 62, 539-91.

[44] Zhang, Qiang, and Masao Ogaki (2001). “Decreasing Risk Aversion and

Tests of Risk Sharing.” Econometrica 69, 515-26.

75 Table 1

Descriptive Statistics

Cross-sectional means and variances are computed using sample weights. The variables ct and yt denote household non-durable consumption and disposable income, respectively. Demo- graphic characteristics refer to the household head.

76 1987 1989 19911993 1995 All years

ln ct 9.90 10.08 10.02 10.0110.00 10.02 var (ln ct) 0.26 0.26 0.29 0.29 0.27 0.28

Gini coefficient of ct 0.28 0.27 0.29 0.29 0.27 0.28 ln yt 10.25 10.40 10.36 10.27 10.27 10.32 var (ln yt) 0.39 0.37 0.37 0.57 0.47 0.45

Gini coefficient of yt 0.35 0.32 0.32 0.36 0.35 0.34

South 0.41 0.37 0.34 0.36 0.39 0.37

North 0.43 0.46 0.48 0.47 0.43 0.46

Family size 3.153.123.043.073.01 3.07

Self-employed 0.20 0.170.170.160.150.16

Years of schooling 7.38 7.97 8.198.038.108.03

Less well educated 0.78 0.72 0.70 0.72 0.70 0.72

More educated 0.22 0.28 0.30 0.28 0.30 0.28

Age 52.00 52.52 52.78 53.05 55.03 53.22

Born 1940 0.60 0.58 0.54 0.49 0.49 0.53 ≤ Born >1940 0.40 0.42 0.46 0.51 0.51 0.47

Income recipients 1.63 1.72 1.72 1.74 1.78 1.73

Number of obs. 1,097 2,717 4,036 4,006 3,211 15,067

77 Table 2

The Transition Matrix of Consumption

The table reports consumption transitions from period t 2toperiodt.Thegenericelement − of this table is pij , the estimated probability of moving from quartile i in period t 2toquartile − j in period t.Define nij as the number of households that move from quartile i in period t 2 b − to quartile j in period t and ni = i nij as the total number of observations in each row i of the transition matrix. The maximumP likelihood estimator of the first-order Markov transition probabilities is then: pij = nij/ni.

b

78 1987-89

1989 quartile

1987 quartile 1st 2nd 3rd 4th

1st 0.71 0.20 0.07 0.02

2nd 0.23 0.42 0.27 0.08

3rd 0.08 0.29 0.40 0.23

4th 0.03 0.09 0.29 0.60

1989-91

1991 quartile

1989 quartile 1st 2nd 3rd 4th

1st 0.66 0.25 0.07 0.01

2nd 0.25 0.41 0.27 0.06

3rd 0.100.270.41 0.25

4th 0.01 0.07 0.25 0.68

1991-93

1993 quartile

1991 quartile 1st 2nd 3rd 4th

1st 0.63 0.26 0.08 0.02

2nd 0.23 0.38 0.29 0.09

3rd 0.11 0.28 0.37 0.25

4th 0.04 0.100.260.60

79

1993-95

1995 quartile Table 3

Parameter Estimates and χ2 Statistics

The table reports simulated minimum χ2 estimates of the parameters φ, λ and α (asymp- totic standard errors in parenthesis) and the χ2 goodness of fit statistic (p-value of the test in square brackets). In column (2) we imposes the (acceptable) restriction that φ = 1.Theesti-

2 2 mates of σζ and σε are obtained in a first step by imposing restrictions on the autocovariances of income growth (see Section 4).

(1)(2)

φ 0.9875 1.0000 (0.0230)

λ 0.1586 0.1622 (0.0377) (0.0248) α 0.3875 0.3822 (0.0186) (0.0116)

2 Variance of permanent shocks (σζ)0.0267 (0.0135)

2 Variance of transitory shocks (σε)0.0794 (0.0123)

χ2 goodness of fitstatistic 15.21 15.22 [0.0853] [0.1241]

Table 4

Simulated and Empirical Transition Matrix of Consumption

The table reports the simulated consumption transitions between 1993 and 1995 and, in parenthesis, the empirical consumption transitions. The simulated transitions probabilities are obtained from the estimates reported in column (2) of Table 3.

80 1995 quartile

1993 quartile 1st 2nd 3rd 4th

1st 0.6748 0.2515 0.0677 0.0061 (0.6700) (0.2528) (0.0660) (0.0112)

2nd 0.2513 0.4111 0.2748 0.0628 (0.2416) (0.4259) (0.2665) (0.0660)

3rd 0.0675 0.2764 0.4175 0.2386 (0.0660) (0.2653) (0.4346) (0.2341)

4th 0.0061 0.0613 0.2401 0.6926 (0.0237) (0.0549) (0.2332) (0.6883)

81 Table 5

Parameter Estimates and χ2 Statistics

2 2 2 The table reports simulated minimum χ estimates of the parameters φ, λ, α, σζ,andσε

(asymptotic standard errors in parenthesis), and the χ2 goodness of fit statistic (p-value of the test in square brackets).

φ 0.9776 (0.0551) λ 0.1542 (0.0735) α 0.3960 (0.0192)

2 Variance of permanent shock (σζ)0.0246 (0.0039)

2 Variance of transitory shock (σε)0.0501 (0.0055)

χ2 goodness of fit statistic 97.81 [<0.0001]

Table 6

Simulated and Empirical Transition Matrices of Consumption and Income

Panel A of the table reports the simulated consumption transitions between 1993 and 1995 and, in parenthesis, the empirical consumption transitions. Panel B repeats for income. Panel

C contains the Spearman joint rank correaltion of consumption and income growth.

82 Panel A: Consumption

1995 quartile

1993 quartile 1st 2nd 3rd 4th

1st 0.6764 0.2502 0.0677 0.0056 (0.6700) (0.2528) (0.0660) (0.0112)

2nd 0.2506 0.4132 0.2739 0.0623 (0.2416) (0.4259) (0.2665) (0.0660)

3rd 0.0667 0.2752 0.4184 0.2398 (0.0660) (0.2653) (0.4346) (0.2341)

4th 0.0059 0.0617 0.2401 0.6923 (0.0237) (0.0549) (0.2332) (0.6883)

Panel B: Income

1995 quartile

1993 quartile 1st 2nd 3rd 4th

1st 0.6793 0.2460 0.0678 0.0068 (0.6588) (0.2204) (0.0872) (0.0336)

2nd 0.2436 0.4091 0.2766 0.0707 (0.2503) (0.4496) (0.2204) (0.0797)

3rd 0.0696 0.2785 0.4081 0.2438 (0.0560) (0.2565) (0.4832) (0.2042)

4th 0.0070 0.0667 0.2475 0.6787 (0.0349) (0.0736) (0.2095) (0.6820)

Panel C: Joint rank correlation

0.4225 (0.4216)

83 Table 7 Simulated and Empirical Mobility for Different Education Groups

The first panel reports summary statistics for two education groups: the variance of the income shocks, the estimates of α and λ (asymptotic standard errors in parenthesis), the asso- ciated χ2 goodness of fit statistic (p-value of the test in square brackets), and the χ2 statistic of the test that the parameters of the two education groups are the same (p-value in square brackets). The education groups are defined as compulsory schooling or less, and high school or college. The other two panels report the simulated transition probabilities and (in parenthesis) the empirical transition probabilities for the two education groups.

Low education High education

2 Variance of permanent shock (σζ)0.0296 0.0198 (0.0115) (0.0347)

2 Variance of transitory shock (σε)0.0754 0.0895 (0.0097) (0.0329)

λ 0.3991 0.0889 (0.0459) (0.0470)

α 0.3814 0.2846 (0.0120) (0.0125)

χ2 goodness of fit statistic 23.03 20.85 [0.0106] [0.0222] χ2 test of parameter equality 55.85 [7.4e 013] −

84 Low education

1995 quartile

1993 quartile 1st 2nd 3rd 4th

1st 0.6432 0.2610 0.0845 0.0113 (0.6449) (0.2633) (0.0830) (0.0088)

2nd 0.2413 0.3863 0.2848 0.0877 (0.2240) (0.4208) (0.2532) (0.1020)

3rd 0.0733 0.2654 0.3913 0.2700 (0.0780) (0.2524) (0.4421) (0.2277)

4th 0.0099 0.0770 0.2498 0.6632 (0.0327) (0.0750) (0.2500) (0.6423)

High education

1995 quartile

1993 quartile 1st 2nd 3rd 4th

1st 0.7037 0.2485 0.0457 0.0022 (0.7016) (0.2258) (0.0605) (0.0121)

2nd 0.2472 0.4463 0.2686 0.0379 (0.2308) (0.4656) (0.2389) (0.0648)

3rd 0.0464 0.2681 0.4661 0.2194 (0.0242) (0.2460) (0.4839) (0.2460)

4th 0.0015 0.0373 0.2156 0.7456 (0.0163) (0.0569) (0.2398) (0.6870)

85 Estimated model λ=0.16, α=0 α PIH, =0 .6

.55

.5 Probability of leaving .45 st 1 quartile .4

.35

.3

.25

.2 0 2 4 6 8 10 12 14 16 18 20 Period

Figure 1: The probability of leaving the first quartile of the consumption distri-

bution.

86 CFS Working Paper Series:

No. Author(s) Title

2005/19 Yunus Aksoy A Quantitative Exploration of the Opportunistic Athanasios Orphanides Approach to Disinflation David Small Volker Wieland David Wilcox

2005/20 Yannis Bilias Equity Culture and the Distribution of Wealth Dimitris Georgarakos Michael Haliassos

2005/21 Silke Brandts ART Versus Reinsurance: The Disciplining Effect Christian Laux of Information Insensitivity

2005/22 Sean D. Campbell Stock Returns and Expected Business Conditions: Francis X. Diebold Half a Century of Direct Evidence

2005/23 Franklin Allen Credit Market Competition and Capital Regulation Elena Carletti Robert Marquez

2005/24 Neil A. Doherty Insuring the Uninsurable: Brokers and Incomplete Alexander Muermann Insurance Contracts

2005/25 Franklin Allen Credit Risk Transfer and Contagion Elena Carletti

2005/26 Michael Haliassos Credit Card Debt Puzzles Michael Reiter

2005/27 Luigi Guiso Trusting the Stock Market Paola Sapienza Luigi Zingales

2005/28 Tullio Jappelli Intertemporal Choice and Consumption Mobility Luigi Pistaferri

Copies of working papers can be downloaded at http://www.ifk-cfs.de