Thesis
Robust methods for personal income distribution models
VICTORIA-FESER, Maria-Pia
Abstract
In the present thesis, robust statistical techniques are applied and developed for the economic problem of the analysis of personal income distributions and inequality measures. We follow the approach based on influence functions in order to develop robust estimators for the parametric models describing personal income distributions when the data are censored and when they are grouped. We also build a robust procedure for a test of choice between two models and analyse the robustness properties of goodness-of-fit tests. The link between economic and robustness properties is studied through the analysis of inequality measures. We begin our discussion by presenting the economic framework from which the statistical developments are made, namely the study of the personal income distribution and inequality measures. We then discuss the robust concepts that serve as basis for the following steps and compute optimal bounded-influence estimators for different personal income distribution models when the data are continuous and complete. In a third step, we study the case of censored data and propose a generalization of the EM [...]
Reference
VICTORIA-FESER, Maria-Pia. Robust methods for personal income distribution models. Thèse de doctorat : Univ. Genève, 1993, no. SES 384
URN : urn:nbn:ch:unige-64509 DOI : 10.13097/archive-ouverte/unige:6450
Available at: http://archive-ouverte.unige.ch/unige:6450
Disclaimer: layout of this document may differ from the published version.
1 / 1
2 Robust Methods for Personal Income Distribution Models
Maria-Pia Victoria Feser
Submitted for the degree of Ph.D in Econometrics and Statistics
Faculty of Economic and Social Sciences University of Geneva, Switzerland
Accepted on the recommendation of Dr. A.C. Atkinson, professor, London, Dr. P. Balestra, professor, Geneva, Dr. U. Kohli, professor, Geneva, Dr. E. Ronchetti, professor, Geneva, supervisor, Dr. P. Rousseeuw, professor, Brussels.
Thesis No. 384
May 1993 i
To Johannes, with love. ii
Abstract In the present thesis, robust statistical techniques are applied and developed for the economic problem of the analysis of personal income distributions and inequality measures. We follow the approach based on influence functions in order to develop robust estimators for the parametric models describing personal income distributions when the data are censored and when they are grouped. We also build a robust procedure for a test of choice between two models and analyse the robustness properties of goodness-of-fit tests. The link between economic and robustness properties is studied through the analysis of inequality measures. We begin our discussion by presenting the economic framework from which the statistical developments are made, namely the study of the per- sonal income distribution and inequality measures. We then discuss the robust concepts that serve as basis for the following steps and compute opti- mal bounded-influence estimators for different personal income distribution models when the data are continuous and complete. In a third step, we study the case of censored data and propose a generalization of the EM algorithm with robust estimators. For grouped data, Hampel’s theorem is extended in order to build optimally bounded-influence estimators for grouped data. We then focus on tests for model choice and develop a robust generalized Cox-type statistic. We also analyse the robustness properties of a wide class of goodness-of-fit statistics by computing their level influence functions. Fi- nally, we study the robustness properties of inequality measures and relate our findings with some economic properties these measures should fulfil. Our motivation for the development of these new robust procedures comes from our interest in the field of income distribution and inequality measurement. However, it should be stressed that the new estimators and tests procedures we propose do not only apply in this particular field, but they can be used in or extended to any parametric problem in which density estimation, incomplete information, grouped or discrete data, model choice, goodness-of-fit, concentration index, is one of the key words. iii
R´esum´e Dans cette th`ese, nous developpons et appliquons certaines techniques de la statistique robuste au probl`eme ´economique de l’analyse de la distribution du revenu personnel et des mesures d’in´egalit´e. Nous utilisons l’approche bas´ee sur les fonctions d’influence afin de developper des estimateurs ro- bustes pour les mod`eles param´etriques d´ecrivant la distribution du revenu personnel lorsque les donn´ees sont censur´ees et lorsqu’elles sont group´ees. Nous construisons aussi des proc´edures robustes pour tester le choix entre deux mod`eles et analysons les propri´etes de robustesse de tests d’ad´equation. Le lien entre certaines propri´etes ´economiques et de robustesse est ´etudi´eau moyen des mesures d’in´egalit´e. Nous commen¸cons notre discussion par une pr´esentation du cadre ´eco- nomique dans lequel nous nous situons,asavoirl’´ ` etude de la distribution du revenu personnel et des mesures d’in´egalit´e associ´ees. Nous exposons ensuite les concepts de la statistique robuste qui nous sont utiles par la suite et cal- culons des estimateurs optimaux `a influence born´ee pour diff´erents mod`eles de distribution de revenu personnel lorsque les donn´ees sont continues et compl`etes, simul´ees ou r´eelles. Dans un troisi`eme temps, nous ´etudions le cas des donn´ees censur´ees et proposons une g´en´eralisation de l’algorithme EM avec des estimateurs robustes. Le th´eor`eme de Hampel est ensuite ´etendu au cas des donn´ees group´ees et des estimateurs robustesa ` influence born´ee de fa¸con optimale sont propos´es. Plus tard, nous nous concentrons sur les proc´edures de choix de mod`ele et d´eveloppons une statistique de test robuste de type Cox. Nous analysons aussi les propri´et´es de robustesse d’une large classe de statistiques de test d’ad´equation en calculant les cor- respondantes fonctions d’influence sur le niveau. Finalement, nous ´etudions les propri´etes de robustesse de mesures d’in´egalite en fonction des propri´et´es ´economiques que ces derni`eres doivent satisfaire. Le d´eveloppement de nouveaux estimateurs et de nouvelles proc´edures de test a ´et´emotiv´eparnotreint´erˆet au probl`eme de l’´etude des distributions de revenu personnel et des mesures d’in´egalit´e. Cependant, il est utile de mettre en ´evidence le fait que les nouveaux estimateurs et les nouvelles proc´edures de test que nous proposons ne sont pas seulement applicables dans ce domaine particulier. En effet, ils peuvent ˆetre appliqu´es ou ´etendus `adesprobl`emes param´etriques dans lesquels des termes comme estimation de densit´e, information incompl`ete, donn´ees group´ees ou discr`etes, choix de mod`ele, tests d’ad´equation, indices de concentration sont des mot-cl´es. iv
Acknowledgement I would like to express my gratitude to Prof. E. Ronchetti for his valuable suggestions and his generous guidance throughout the course of this research. His encouragement as expert and as friend have made this work possible. I am also grateful to Prof. A. C. Atkinson and Dr. F. Cowell for their support during my research at the London School of Economics and to Prof. P. Balestra, Prof. U. Kohli and Prof. P. Rousseeuw for their comments during the defense. My thanks also go to my friends and colleagues of the faculty of eco- nomic and social sciences of the University of Geneva for their stimulating discussions and their moral support, especially to S. H´eritier for his helpful comments during the preparation of the defense. Finally, I would like to express my grateful thanks to my parents, for their love, encouragement and support during most of my student life. Contents
1 Introduction 1
2 Income Distribution and Inequality 7 2.1Introduction...... 7 2.2Thegenerationanddistributionofincome...... 8 2.3TheLorenzcurveandanalysisofinequality...... 10 2.3.1 Definitionandconstruction...... 10 2.3.2 Anorderingtool...... 13 2.4Incomeinequalitymeasures...... 14 2.5Parametricmodelsforincomedistributions...... 18 2.5.1 Generatingsystems...... 19 2.5.2 Propertiesofincomedistributionmodels...... 20 2.5.3 Mostcommonlyusedmodels...... 22 2.6Statisticalaspectsoftheanalysisofincome...... 24
3 OBRE with Complete Information 27 3.1Introduction...... 27 3.2Robustnessconcepts...... 30 3.2.1 Definitions...... 30 3.2.2 Continuityandqualitativerobustness...... 31 3.2.3 The influence function ...... 31 3.2.4 The influence function and robustness measures . . . . 32 3.2.5 Thebreakdownpoint...... 33 3.3Optimalrobustestimators...... 34 3.3.1 Optimalityresults...... 34 3.3.2 Computationalaspects...... 39 3.3.3 Howtochoosetheboundc ...... 42 3.4Applicationtotwoincomedistributionmodels...... 44 3.4.1 Simulationresults...... 44
v vi CONTENTS
3.4.2 Applicationtorealdata...... 50 3.5Propertiesofrobustestimators...... 57 3.5.1 Efficiency...... 57 3.5.2 Sensitivity...... 57 3.5.3 Breakdownpoint...... 61
4 OBRE with Incomplete Information 65 4.1TheEMalgorithm...... 65 4.1.1 Introduction...... 65 4.1.2 DefinitionoftheEMalgorithm...... 66 4.1.3 Discussionandexample...... 67 4.1.4 Generalization...... 68 4.2OBREwithincompleteinformation...... 70 4.2.1 TheEMMalgorithm...... 70 4.2.2 The EMM algorithm and robust estimators ...... 71 4.2.3 Comparisonwiththeclassicalapproach...... 74 4.3Empiricalresults...... 76 4.3.1 Robustestimates...... 76 4.3.2 ComparisonwiththeMLE...... 77 4.3.3 Conclusion...... 80
5 Robust Estimators for Grouped Data 83 5.1Theproblem...... 83 5.2 Classical estimators and their IF ...... 86 5.2.1 Minimumpowerdivergenceestimators...... 86 5.2.2 Influence function ...... 87 5.3Ageneralclassofestimators...... 90 5.4 Influence function of MGP-estimators ...... 92 5.5Robustestimatorswithgroupeddata...... 94 5.5.1 Optimalityproblem...... 94 5.5.2 Computation...... 97 5.5.3 Efficiency...... 98 5.6Simulationresultsandconclusion...... 99 5.6.1 Simulationresults...... 99 5.6.2 Localshiftsensitivity...... 100 5.6.3 Conclusion...... 102
6 Robust Tests for Model Choice 103 6.1Introduction...... 103 6.2Classicaltests...... 107 CONTENTS vii
6.2.1 TheCoxstatistic...... 107 6.2.2 TheAtkinsonstatistic...... 109 6.2.3 Otherapproaches...... 110 6.3Smallsampleproperties...... 111 6.3.1 Introduction...... 111 6.3.2 Simulationstudy...... 112 6.3.3 A Cox-type statistic with a parametric bootstrap . . . 114 6.4Robustnessproperties...... 117 6.4.1 Robustnessandtests...... 118 6.4.2 Level influence function ...... 120 6.4.3 Simulationresults...... 124 6.5Robustmodelchoicetests...... 125 6.5.1 Someadhocrobusttests...... 125 6.5.2 Robust bounded-influence LM test ...... 128 6.5.3 RobustCox-typestatistic...... 130 6.5.4 Simulationstudy...... 132 6.6Conclusion...... 133
7 Robustness and Goodness-of-Fit Tests 135 7.1Introduction...... 135 7.2Robustnessandgoodness-of-fittechniques...... 139 7.3LIFoftheCressieandReadstatistic...... 141 7.4Simulationstudy...... 147
8 Robustness and Inequality Measures 151 8.1Introduction...... 151 8.2 Decomposability and mean-preserving cont...... 154 8.2.1 Generalproperties...... 155 8.2.2 Robustnessproperties...... 156 8.2.3 Kolm’sindex...... 157 8.2.4 Generalizedentropyindex...... 158 8.2.5 TheGiniindex...... 159 8.3Arbitrarycontaminations...... 161 8.3.1 Simulationstudy...... 162 8.4Parametricestimationapproach...... 163 8.4.1 Influence function of generalized entropy indexes . . . 163 8.4.2 Simulationresults...... 165 8.5Conclusion...... 166
9 Conclusion 169 viii CONTENTS
A Functional Forms for PID 173 A.1Terminologyandnotations...... 173 A.2ParetotypeI...... 174 A.3ParetotypeII...... 174 A.4ParetotypeIII...... 175 A.5Gammadistribution...... 175 A.6Beninidistribution...... 176 A.7Vincidistribution...... 176 A.8GeneralizedGammadistribution...... 177 A.9LognormaltypeI...... 177 A.10Davisdistribution...... 178 A.11Weibulldistribution...... 178 A.12Fisklogisticdistribution...... 179 A.13GeneralizedBetadistributionI...... 180 A.14GeneralizedBetadistributionII...... 180 A.15Singh-Maddaladistribution...... 181 A.16LognormaltypeII...... 182 A.17DagummodeltypeI...... 182 A.18DagummodeltypeII...... 183 A.19DagummodeltypeIII...... 183 A.20Log-Gompertzdistribution...... 184 A.21Majumder-Chakravartydistribution...... 184
B Functional Forms for the Lorenz Curve 187 B.1ModelofKakwaniandPodder...... 187 B.2ModelofRascheetal...... 187 B.3ModelofGupta...... 187 B.4 Model of Villasenor and Arnold ...... 188 B.5ModelofBasmannetal...... 188
C Income Inequality Measures 189 C.1Coefficientofvariation...... 189 C.2Relativemeandeviation...... 189 C.3Relativemediandeviation...... 189 C.4 Variance of the logarithm of income ...... 190 C.5Bonferroniinequalitymeasure...... 190 C.6Hirschman’sindex...... 190 C.7Theilindexes...... 190 C.8 Eltet¨oandFrigyes’sinequalitymeasures...... 190 C.9Kakwaniinequalitymeasure...... 191 CONTENTS ix
C.10Basmann-Slottjeinequalitymeasure(WGM)...... 191 C.11Dalton’sinequalitymeasure...... 191 C.12Atkinson’sinequalitymeasure...... 192 C.13Kolm’sinequalitymeasure...... 192 C.14Generalizedentropyfamily...... 192
D Equations System for Robust Tests 193 x CONTENTS List of Figures
2.1 A typical representation of the Lorenz Curve ..... 11 2.2 The Gamma density as a model for PID ...... 23
3.1 Value of the mean IF around the true parameter θ .. 42 3.2 MLE and OBRE of the Gamma distribution on PSID data ...... 53 3.3 MLE and OBRE of the Dagum I model on PSID data 54 3.4 Gamma (OBRE) and Dagum (MLE) fit on PSID data 55 3.5 OBRE of the Gamma and Dagum I model on FES data 56 3.6 Efficiency of the OBRE for the Gamma model ..... 58 3.7 Efficiency of the OBRE for the Pareto model ..... 59 3.8 Sensitivity of the MLE and the OBRE to outliers for the Pareto model ...... 60 3.9 Sensitivity of MLE and OBRE to different propor- tions of contamination ...... 62 3.10 Bias of Theil index estimates when the data are con- taminated ...... 64
4.1 Weights given by the OBRE with 10% of information loss ...... 75 4.2 Weights given by the OBRE with 30% of information loss ...... 76
7.1 Behaviour of goodness-of-fit statistics with model con- tamination ...... 149
xi xii LIST OF FIGURES List of Tables
3.1 Some examples of occurrence and frequency of gross errors ...... 29 3.2 MLE and OBRE for the Gamma model 1 (non con- taminated) ...... 46 3.3 MLE and OBRE for the Gamma model 2 (1% of ‘bad’ contamination) ...... 47 3.4 MLE and OBRE for the Gamma model 3 (3% of con- tamination) ...... 47 3.5 MLE and OBRE for the Gamma model 4 (5% of con- tamination) ...... 47 3.6 MLE and OBRE for the Pareto model 1 (non con- taminated) ...... 49 3.7 MLE and OBRE for the Pareto model 2 (2% of con- tamination) ...... 49 3.8 MLE and OBRE for the Pareto model 3 (5% of con- tamination) ...... 49 3.9 MLE and OBRE for the Gamma and Dagum models on PSID data...... 52 3.10 MLE and OBRE for the Gamma and Dagum models on FES data...... 53
4.1 OBRE on non contaminated data, with the EMM algorithm and the CD estimation ...... 77 4.2 OBRE on contaminated data at 1%, with the EMM algorithm and the CD estimation ...... 78 4.3 OBRE on contaminated data at 3%, with the EMM algorithm and the CD estimation ...... 78 4.4 OBRE and MLE on non contaminated data, with the EMM algorithm ...... 79
xiii xiv LIST OF TABLES
4.5 OBRE and MLE on contaminated data at 1%, with the EMM algorithm ...... 79 4.6 OBRE and MLE on non contaminated data, with the EMM algorithm, when we ignore truncation ...... 80
5.1 MLE and OBRE (c = 5.0) for the Pareto model with grouped data ...... 100
6.1 Finite sample level of Cox and Atkinson statistics (Gamma against Lognormal) ...... 113 6.2 Finite sample level of Cox and Atkinson statistics (Exponential against Pareto) ...... 114 6.3 Finite sample level of Cox and Atkinson statistics (Pareto against Exponential) ...... 115 6.4 Finite sample level of LKR statistic (Gamma against Lognormal) ...... 116 6.5 Finite sample level of LKR statistic (Exponential against Pareto) ...... 116 6.6 Finite sample level of LKR statistic (Pareto against Exponential) ...... 116 6.7 Power (in %) of the Cox statistic (Exponential against Pareto) ...... 117 6.8 Power (in %) of the LKR statistic (Exponential against Pareto) ...... 118 6.9 Actual levels (in %) of Cox and Atkinson statistics under model contamination (ε =1%) (Gamma against Lognormal) ...... 124 6.10 Actual levels (in %) of Cox and Atkinson statistics under model contamination (ε =2%) (Gamma against Lognormal) ...... 125 6.11 Actual levels (in %) of the robust Atkinson statistic (c =2.0) with contamination (Pareto against Expo- nential) ...... 132 6.12 Actual levels (in %) of the Atkinson statistic with contamination (Pareto against Exponential) ...... 133
8.1 Empirical Theil index when a random proportion of data are multiplied by 10 ...... 163 8.2 Empirical Theil index when a random proportion of data are multiplied by 4 ...... 164 LIST OF TABLES xv
8.3 MLE and Theil index with and without data contam- ination ...... 165 8.4 OBRE and Theil index with and without data con- tamination ...... 166 xvi LIST OF TABLES Chapter 1
Introduction
A great number of philosophers, scientists, politicians, economists, writers, humanists, religious people through the ages have spent a lot of energy trying to understand the reasons of human inequalities. It is hard to believe there will be one day an answer. Our work however was motivated by this kind of question: why are there so great differences in people’s wealth? The present dissertation is not a philosophical essay, but a modest scientific contribution to the study of one of the several aspects of human wealth, the distribution of the income among the people. Moreover, its aim is even not to try to give some elements of an answer to the question, but to provide the economist with new statistical tools, developed especially for the matter. The distribution of income among people is also called the personal in- come distribution (PID). In economics, its study has several scopes. One of them is to understand how the total income in a given society is distributed among the people, or the households, or economic units, that is to deter- mine which economic and social factors influence the distribution of income. Another aim is to provide a measure which represents a judgement of the degree of inequality in the distribution of personal income, not only by itself but also when compared with the same measure computed on the basis of data from different populations. The space for the statistician is then wide. There are (a) stochastic models to build (for explaining how the PID is generated), (b) econometric models to define and estimate (for determining the factors influencing the generation and distribution of income), (c) statistical distributions to de- fine and estimate (for describing PID) and (d) inequality indexes to build and estimate (for measuring income inequality). In the present work, we
1 2 CHAPTER 1. INTRODUCTION concentrate on the two last aspects. The regularities displayed by observed PID over time and space pro- vide sufficient justification to describe them with the help of some statis- tical distribution functions. This provides not only a useful summary of the phenomenon, but also a technique to study the effects of alternative redistributive policies. In particular, the estimated distribution can serve as a basis for the computation of inequality measures. The phenomenon of income inequality has been a source of world-wide social upheaval. It has become a weapon in the hands of social reformers and a point of intellectual debate among academics. It is therefore necessary to invest energy in the development of appropriate statistical tools. The two main aspects of this debate, ethical evaluation and statistical measurement, are not always clearly distinguishable. In our work, however, the statistical tools we propose do not only apply to the study of PID or inequality measures but to a wider range of similar problems. On the other hand, our work was directed by the specificity of the economic problem, that is the developments we made were motivated by their usefulness in the study of PID and income inequality measures. Moreover, in the case of inequality measure we have a closer look at the relations between the economic and statistical properties. This is important because drawing inferences about economic inequality plays an important part in political debates about economic and social trends, and in a variety of applied studies in the field of welfare economics. However, the statistical basis on which the inferences are drawn is not always spelt out, and so the relationship between the numbers observed in a particular sample and the supposed underlying concept of inequality within the target population may be different from that suggested by superficial appearances. The statistical innovation we propose in this field is the use of robust methods. Robustness is a statistical concept which in a sense measures a “qualitative” aspect of any estimator, more precisely its stability under non standard conditions. It also conveys the idea that the theoretical models, may they be simple or very complex, are only able to reflect the behaviour of the majority of the data. That is, the robust statistical tool is built such that the influence of data that may not belong to the stated theoretical model is limited. It is well known that economic data in particular are far from being clean; this usually means that some observations may be present which in a sense have nothing to do with the majority of the data. These rogue data can be a result of the collection procedures. A simple example is the “decimal point error”: the coder inadvertently puts the decimal point in 3 the wrong place and thus multiplies an observation by a factor of 10. More subtle is the week-month confusion where data are supposedly collected on weekly income, but some respondents actually report income per month. If those observations that we also call contaminations or outliers have negligible impact upon the analysis, then obviously there is nothing to worry about. Unfortunately, in most cases they are extreme and therefore they can drive the value of the estimators by themselves. It is arguable that an outlier of this sort should be treated as exceptional and dropped from the sample. Such extreme values may of course be picking up true information; but very often in empirical work a case can be made for dropping “obviously” an in- appropriate or suspect observation that may be the result of recording error or other contamination. This type of ad hoc procedure is unsatisfactory, but if it is not done then the result of the analysis may be seriously biased. The robust methods we propose automatically take into account the presence of extreme observations during the estimation procedure. Indeed, these robust estimation procedures are built in such a way that they provide at the same time and in an optimal way, robust estimators and weights corresponding to each observation according to its ‘distance’ form the bulk of the majority of the data. There is therefore no need for a preliminary subjective data screening. The developments we make are organized in the following way. In chap- ter 2 we present the framework of PID and inequality measures. We first discuss very briefly the different theories explaining the generation and dis- tribution of income. We also present the Lorenz curve, a statistical tool for representing and comparing inequality in the PID, and review the most well known income inequality measures. The different parametric models pro- posed in the literature to describe the PID are then analyzed. A discussion about the statistical aspects involved in the analysis of PID concludes this chapter. In chapter 3, we compute robust estimators for PID models when the data are continuous and complete. We begin by presenting the robustness concepts we need to later develop the theory, in particular the influence function (IF). The IF is the main robustness tool we use in our develop- ments. It gives the influence of an infinitesimal amount of contamination introduced in the data on the value of any statistic (e.g. estimator, test statistic, inequality measure...) Since the case of continuous and complete data is simple, we use optimal robust bounded-influence estimators already developed in the literature. However, our contribution is the application of the general theory to the case of PID models, in particular with real data. In chapter 4 we widen the framework to censored data. To compute ro- 4 CHAPTER 1. INTRODUCTION bust estimators in this case we propose a generalization of the EM algorithm, namely the EMM algorithm. The former allows one to compute maximum likelihood estimators (MLE) when the data are censored. The EMM algo- rithm allows one to compute robust estimators in the same situation. After a presentation of the EM algorithm, we discuss its generalization. We then compare the EMM algorithm when the data are truncated with the classical approach which considers the conditional distribution. Since the data on PID are numerous, they are often presented in a grouped form. In chapter 5 we build robust estimators for this case. We first present a large class of classical estimators and compute their IF. Although we find that the IF is bounded, that is the influence of infinitesimal amounts of contamination on the value of the estimators is limited, we show that it can nevertheless be large. Therefore, after defining a more general class of estimators, we find estimators which are less influenced by contamination. We conclude chapter 5 with a simulation study in which we compare the classical and robust estimators for grouped data. At this stage, we will have provided the necessary tools to compute ro- bust estimators for a given parametric PID model. However, this work would be incomplete without the development of a robust procedure for choosing one PID model. This is actually the subject of chapter 6. We concentrate on tests between non-nested hypotheses. We begin by presenting the most well known test statistics, particularly the Cox-type statistics. We then highlight one of their disadvantages, namely the non accuracy of the approximation of their distribution by means of their asymptotic distribution, even when the sample sizes are relatively large. We also study their robustness properties, that is the influence on the asymptotic level of the test due to infinitesimal amounts of contamination. Finally, we propose a robust test statistic based on a robust parametric test, which avoids the problems raised previously, and show its performance when computed from contaminated samples. In order to be as complete as possible, we also study goodness-of-fit tests in chapter 7. Actually, we only show that the classical goodness-of-fit tests can be badly influenced by infinitesimal amounts of contamination among the data. In particular, we compute the influence of such contaminations on the asymptotic level of the test and present a numerical example. Finally, in chapter 8, we study more in details the case of inequality measures. Indeed, these measures can be thought of as estimators of the true underlying inequality which depends on the distribution of income. Therefore, we can compute their IF. However, the aim is not the same as with PID models, in that this time we want to relate the behaviour of the IF to the economic properties the different inequality measures fulfil. 5
We find that in some special cases some inequality measures have a bounded IF, but unfortunately we must conclude that in the more realistic cases, the IF of a very large class of inequality measures is unbounded. We conclude this chapter with the proposition of robust inequality measures via robust estimates of parametric PID models and present a simulation study. Finally, chapter 9 concludes the present work. 6 CHAPTER 1. INTRODUCTION Chapter 2
Income Distribution and Inequality Measures
2.1 Introduction
The topic of the income distribution can be stated in the general economic framework as did Harold Lydall: ‘The essential problem of economics is how to increase economic welfare. In a broad sense, this problem can be di- vided into two parts: how to increase total output from given resources; and how to distribute the resulting goods and services in such way as to give the community the most benefit from them. These two aspects are sometimes described as the problem of ‘production’ and the problem of ‘distribution’, respectively. The two parts are not, of course, independent; and many of the most difficult questions arise out of the interdependence of production and distribution. Nevertheless, it is possible to identify some influences which bear primarily on the side of production and others which primarily affect distribution. No progress could be made in the discussion unless we ab- stracted, at least temporarily, from some of the considerations which might eventually be shown to be relevant to one or other side’ (Lydall 1968). Research on income distribution has followed two main directions. The first deals with the factor price formation and the corresponding factor shares, i.e. the distribution of income among the factors of production. This approach was initiated by Ricardo (1819), and further developed by several schools of economic thought. The second approach deals with the distribution of a mass of income among the members of a set of economic units (family, household, individual), considering either the total income of each economic unit or its desegregation by source of income, such as wages
7 8 CHAPTER 2. INCOME DISTRIBUTION AND INEQUALITY and salaries, property income, self-employment income, transfers, etc. The related topic of the latter approach is commonly called the size distribu- tion of income or personal income distribution (PID). This chapter is only concerned with this topic. According to Slottje (1989), the theory of PID can be divided into three major categories. Models explaining the generation of income distributions are one of the important aspect of the theory. Why are incomes in a given society at a given time different? What are the determinants influencing the particular aspects of the income distribution? These are the main questions that researchers interested in the generation of income have tried to answer. In section 2.2 we briefly present the different theories. Another important aspect regards the measures of inequality given the income distribution. Since it can be argued that the principal indicator of social welfare is given by the income level and the PID, it is important to develop tools to compare different societies on a social welfare basis. This is the role of the Lorenz curve and inequality measures, developed respectively in sections 2.3 and 2.4. Finally, an alternative way of studying PID is to describe it by means of statistical tools. As will be argued below, this approach has a lot of advantages. Moreover, it can serve as a basis for the study of the effect of policies on the PID. This is why a number of authors have concentrated their research on modelling the PID by means of parametric models. This approach is developed in section 2.5 and serves as basis of the research presented in the following chapters.
2.2 The generation and distribution of income
The various theories that have been proposed to explain the distribution of income among individuals have emerged from two main schools of thought. The first may be called the stochastic theory of distribution and is rep- resented by such authors as Gibrat (1931), Champernowne (1937, 1953), Aitchison and Brown (1954), Rutherford (1955), Mandelbrot (1960, 1961) and Steindl (1965). These authors explain the generation of income with the help of stochastic processes, that is the actual form of the distribution is the stationary state of a stochastic process. For example, Gibrat (1931) formulated his theory based on the law of proportionate effect, and pro- posed a model which generates a positively skewed distribution. Gibrat’s model is a first-order Markov chain model. The variables are expressed in their logarithms with the log of income dependent on the log of income 2.2. THE GENERATION AND DISTRIBUTION OF INCOME 9 lagged a period and random events. The theory shows that, as time goes by, the distribution of income approaches the distribution of the random disturbance, which tends toward normality. Hence, he then proposed the lognormal distribution as a suitable income distribution. In 1937, Champernowne based his stochastic theory on Markov chains which generated a Pareto distribution. Later, in 1953, he showed that under suitable conditions, his stochastic process tends toward a unique equilibrium distribution dependent upon the transition matrix but not on the initial dis- tribution. Typically, his models specify transitional probabilities, that is the probabilities that units belonging to a certain class will move up or down to another class in the following period of time. Then, the income distribution at a certain time is linked to the income distribution one period before by a transition equation through transition probabilities. Champernowne made some quite strong assumptions which considerabilly simplified his models. He supposed firstly that no income unit moves up by more than one income class in a period of time, and secondly that the transitional probabilities are constant with respect to time and independent of the income level. He then proposed different models able to generate the Pareto distribution, a two-tailed distribution obeying the Pareto law (see section 2.5) and other distributions, by relaxing his initial hypotheses. Finally, similar variations of these stochastic models were compiled by the authors mentioned above. We should also say that one main criticism about the simple stochastic models is that the process requires an incredibly long period of time to attain an equilibrium or a stationary state distribution (Shorrocks 1973). The second school of thought, which may be called the socioeconomic school, seeks the explanation of PID by means of economic and institutional factors, such as sex, age, occupation, education, geographical differences, and the distribution of wealth. Three groups of authors belong to this school. The first follows the human capital approach, based on the hypothesis of lifetime income maximization. The authors concentrate their attention on the supply side of labour, which is the result of the maximization of the personal utility function. Typically, they specify a utility function in which variables such as the general price level, the salary, education, family size, etc, are entered. This approach was initiated by Mincer (1958) and Becker (1962, 1967) and subsequently developed by Chiswick (1968, 1971, 1974). The second group of authors, who mainly link the PID to education levels, is referred to as the education planning school by Tinbergen (1975). It is represented by such authors as Bowles (1969), Dougherty (1971, 1972) and Psacharopulos and Hinchliffe (1972). They concentrate their attention on 10 CHAPTER 2. INCOME DISTRIBUTION AND INEQUALITY the demand side, deriving various types of labour from production functions. Hence, in this case a production function is specified, and it includes not only the classical production factors such as land, labour and capital, but also technical development, different types of labour often being measured bythedegreeofschooling. Finally, the third group of authors is called the supply and demand school. The major contribution to this approach is represented by Tinbergen (1975), who considers PID a result of the supply and demand of different kinds of labour.
2.3 The Lorenz curve and analysis of income in- equality
The Lorenz curve, widely used to represent and analyze the size distribution of income and wealth, is defined as the relationship between the cumulative proportion of income units and the cumulative proportion of income received when units are arranged in ascending order of their income. Lorenz (1905) proposed this curve in order to compare and analyse in a non-parametric way, inequalities of wealth in a country during different epochs, or in different countries during the same epoch. The curve has been used principally as a convenient graphical device to represent size distributions of income and wealth. It is a very useful tool not only for the nonparametric description of the observed PID but also for the comparison in terms of social welfare between income distribution states. In the first subsection we present the technical aspect of the Lorenz curve and in the second subsection we explain the link between the Lorenz curve and the concept of social welfare.
2.3.1 Definition and construction The graph of the curve is represented in a unit square (see figure 2.1). The straight line joining the points (0,0) and (1,1) is called the egalitarian line, because along this line 10% of income receivers get 10% of income, 20%, 20% of income, and so on. Typically, actual distributions lie below the egalitarian line: the greater the convexity, the greater the degree of inequality. More formally, the Lorenz curve can be defined by means of the prob- ability distribution function associated to the income variable. Let F be x this function, and define p = F (x)= 0 dF (t), where p can be interpreted as the proportion of units having an income less than or equal to x.Pietra (1915) (see also Gastwirth (1971)) presented a definition of the Lorenz curve 2.3. THE LORENZ CURVE AND ANALYSIS OF INEQUALITY 11 Proportion of income received 0.0 0.2 0.4 0.6 0.8 1.0
0.0 0.2 0.4 0.6 0.8 1.0 Proportion of income receivers
Figure 2.1: A typical representation of the Lorenz Curve in terms of the inverse of the cumulative distribution function given by F −1(t)=inf{x : F (x) ≥ t}. The Lorenz curve is then written as 1 p L(p)= F −1(t)dt (0 ≤ p ≤ 1) (2.1) µ 0 where the mean µ is given by µ = xdF (x). (see also Kakwani 1980 for another definition). The Lorenz curve can also be interpreted as a tool to measure the con- centration of the income variable. It represents a point comparison measure as regard to a synthetic comparison measure given by inequality measures (see Zenga 1989). The Lorenz curve satisfies the following conditions (Kakwani 1980): 1. if p =0thenL(p)=0
2. if p =1thenL(p)=1 x ≥ 1 3. L (p)= µ 0andL(p) = µf(x) > 0 12 CHAPTER 2. INCOME DISTRIBUTION AND INEQUALITY
4. L(p) ≤ p where f(x) is the density function associated to F (x). It is possible to use the Lorenz curve as a parametric tool. Two ap- proaches have been considered. The more obvious is simply to choose a parametric distribution for the income variable, and derive analytically the corresponding Lorenz curve. (see Dagum 1980a for the derivation of the Lorenz curve corresponding to some well known parametric models for PID). The second approach was formulated by Kakwani and Podder (1973, 1976). They specified the functional form of the Lorenz curve directly (for the gen- eral case) as L(p)=pαe−β(1−p) (2.2) where 0 ≤ p ≤ 1, a curve which satisfies the properties described above. Other authors proposed new parametric families as models for the Lorenz curve. The list includes models due to Rasche et al. (1980), Gupta (1984), Villase˜nor and Arnold (1989), and more recently Basmann et al. (1990). We present these functions in appendix B. As we see in the next section, the Lorenz curve is a very useful tool for the analysis of inequality. It can be used (a) to measure the income inequality (see Gini 1910), (b) to perform a partial ordering of social welfare states (see Atkinson AB 1970), (c) to study the effect of income taxes (see Latham 1988), (d) to derive goodness-of-fit tests for exponential distribution functions, as well as upper and lower bounds for the Gini ratio (see Gastwirth 1972 and Gail and Gastwirth 1978b). Finally, it should be stressed that recently another measure of point con- centration has been proposed in the literature: the Z(p) concentration curve (see Zenga 1984). It is based on the comparison of the inverse cumulative −1 distribution function F (p) with the inverse first incomplete moment func- −1 1 x tion F1 (p), where F1(x)= µ 0 tdF (t). The Z(p) concentration curve is given by −1 − F (p) Z(p)=1 −1 (2.3) F1 (p) According to Zenga (1989), its advantage is that it has not a ‘forced be- haviour’ that is it does not fulfil property 3 for the Lorenz curve. Indeed, this property implies that the difference function p − L(p) assumes its max- imum value for p = F (µ) and there is no reason why the relative inequality must be greater for the middle classes than for the richest or the poorest classes. (For examples of application see Dancelli 1989 and Salvaterra 1989). 2.3. THE LORENZ CURVE AND ANALYSIS OF INEQUALITY 13
2.3.2 An ordering tool
As we have seen, the Lorenz curve displays the deviation of each individual income from perfect equality, and hence it captures the essence of inequality. The nearer the Lorenz curve is to the egalitarian line, the more equal the distribution of income will be. Consequently, the Lorenz curve can be used as a criterion for ranking PID. The income ranking uses the concept of “Lorenz-domination”. An in- come profile is said to Lorenz dominate another income profile in the weak sense if the Lorenz curve of the former lies nowhere below the Lorenz curve of the latter. We have strict Lorenz domination if we add the restriction that at some places the Lorenz curve is above. However, this ordering is a quasi-ordering, since if the Lorenz curves intersect, neither of the income profiles is said to be preferred. There is also a link between the Lorenz curve ranking and social welfare. Atkinson AB (1970) proved a theorem which implies that if the Lorenz curve corresponding to one PID is above the Lorenz curve of another PID (and both have the same mean), then the social welfare function (or social evaluation function, see Chakravarty 1990, for a definition) is greater for the first population, regardless of the form of the utility function except that it is increasing and concave. Later, Dasgupta, Sen and Starret (1973) and Rothschild and Stiglitz (1973) generalized Atkinson’s theorem by weakening the hypotheses. (For a relation between social welfare functions and PID, see also Dagum 1990). In practice, international comparisons involve usually different popula- tion sizes and different means, as do intertemporal comparisons for the same country. Therefore, Shorrocks (1983) generalized the Lorenz curve by scal- ing it up by the mean income. He also proved that an unambiguous ranking of income profiles (providing some suitable conditions on the social welfare function) can be obtained if and only if the generalized Lorenz curves do not intersect. This is likely to be true in many important practical situ- ations, since differences between Lorenz curves tend to be relatively small compared with variations in mean incomes. However, the welfare judgement captured by the generalized Lorenz domination may come into conflict with the social desire for more equally distributed income. For example, if we in- crease the income of the richest individual, keeping all other incomes fixed, the total income increases, as does inequality. It is nevertheless possible to have a PID ranking which avoids this kind of conflict. As Shorrocks (1983) showed, if we assume that an improvement of each income by a constant improves the social welfare function, then the comparison of the generalized 14 CHAPTER 2. INCOME DISTRIBUTION AND INEQUALITY
Lorenz curves discounted for each individual by the mean income, gives a quasi-ordering of the income profiles.
2.4 Income inequality measures
While the Lorenz criterion provides only a quasi-ordering of income profiles, an alternative statistic that completely orders the set of all income profiles is an inequality index. An inequality index is a scalar representation of interpersonal income differences within a given population, and hence can be considered as a measure of inequality (if inequality is defined in terms of income). One of the first inequality indexes was developed by Corrado Gini. It is still one of the most widely used to analyse the size distribution of income and wealth. Gini (1912) specified the Gini mean difference which is by definition n n ∆= |xj − xi|/n(n − 1) (2.4) j=1 i=1 where 0 ≤ ∆ ≤ 2µ. The last expression can also be written ∆= |x − y|dF (x)dF (y) (2.5) where X and Y are identically and independently distributed variables. When all the incomes are equal, then ∆ = 0 and when the last income is equal to the total income, then ∆ = 2µ. Since ∆ is a monotonic increasing function of the degree of income inequality, Gini defined
IG =∆/2µ (2.6)
0 ≤ IG ≤ 1, as an income inequality measure, known as the Gini ratio or Gini index. Gini also proved that his index is equal to twice the area between the egalitarian line and the Lorenz curve L(p) and hence can be given by