S
Statistics and Economics I.; Fisher, R. A.; Frequentist approach to infer- ence; Galton, F.; Gauss, C. F.; Gauss–Markov Aris Spanos theorem; Generalized method of moments; Graphical techniques; Graunt, J.; Haavelmo, T.; Heckman, J. J.; Hume, D.; Identification; Index numbers; Induction; Inverse probability; Abstract Jevons, W. S.; King, G.; Koopmans, T. C.; Some statisticians and economists might find it Laplace, P.-S.; Law of large numbers; Least surprising to learn that statistics and economics squares; Legendre, A.-M.; Life tables; Mar- share common roots going back to ‘Political ginal revolution; Mathematics and economics; Arithmetic’ in the mid-17th century. The pri- Mills, F. C.; Mortality; Neyman, J.; Nonpara- mary objective of this article is to revisit the metric methods; Pearson, K.; Petty, W.; common roots and trace the parallel develop- Playfair, W.; Political arithmetic; Political ment of both disciplines up to and including economy; Probability; Quetelet, A.; Reliability the 20th century, and to attempt to signpost of inference; Royal Statistical Society; certain methodological lessons that were Semiparametric methods; Simultaneous equa- missed along the way to the detriment of both tions models; Specification; Spurious regres- disciplines. The emphasis is primarily on sions; Statistical adequacy; Statistical methodological developments, with less atten- description; Statistical inference; Statistical tion paid to institutional developments. models; Statistical Society of London; Statis- tics and economics; Stochastic processes; Structural models; Unit roots; Walras, L.; Keywords Yule, G. U. ARIMA models; Bayes, T.; Bernoulli, J.; Bowley, A. L.; Central limit theorems; Cointegration; Convergence in distribution; JEL Classifications Cournot, A. A.; Cowles Commission; Dave- B0 nant, C.; Econometric Society; Edgeworth, F. Y.; Error-correction models; Farr, W.; Fisher, The close interrelationship between economics and statistics, going back to their common roots This chapter was originally published in The New Palgrave in ‘Political Arithmetic’, played a crucial role in Dictionary of Economics, 2nd edition, 2008. Edited by availing the development of both disciplines Steven N. Durlauf and Lawrence E. Blume
# The Author(s) 2008 Palgrave Macmillan (ed.), The New Palgrave Dictionary of Economics, DOI 10.1057/978-1-349-95121-5_1935-1 2 Statistics and Economics during their practical knowledge (pre-academic) Mortality, published in 1662 (see Hald 1990; period. Political economy was first separated from Stigler 1986), the first systematic study of demo- political arithmetic and became an academic graphic data on birth and death records in English discipline – the first social science – at the end of cities. Graunt detected surprising regularities the 18th century, partly as a result of political stretching back over several decades in a number arithmetic losing credibility. Statistics emerged of numerical aggregates, such as the male/female as a ‘cleansed’ version of political arithmetic, ratio, fertility rates, death rates by age and loca- focusing on the collection and tabulation of data, tion, infant mortality rates, incidence of new dis- and continued to develop within different disci- eases and epidemics, and so on. On the basis of plines including political economy, astronomy, these apparent regularities, Graunt proceeded to geodesy, demography, medicine and biology; draw certain tentative inferences and discuss their however, it did not become a separate academic implications for important public policy issues. discipline until the early 1900s. Hald summarized the impact of this path-breaking During the 19th century the development of book as follows: statistics was institutionally nurtured and actively Graunt’s book had immense influence. Bills of mor- supported by the more empirically oriented polit- tality similar to the London bills were introduced in ical economists such as Thomas Malthus who other cities, for example, Paris in 1667. Graunt’s helped to create section F of the Royal Society, methods of statistical analysis were adopted by ‘ ’ Petty, King and Davenant in England; Vauban in called Economic Science and Statistics , and sub- France; by Struyck in the Netherlands; and some- sequently to found the Statistical Society of Lon- what later by Sussmilch in Germany. Ultimately, don. The teaching of statistics was introduced into these endeavours led to the establishment of gov- fi ’ the university curriculum in the 1890s, primarily ernmental statistical of ces. Graunt s investigation on the stability of the sex ratio was continued by in economics departments (see Walker 1929). Arthuthnott and Nicolas Bernoulli. (Hald 1990, The close relationship between economics and p. 103) statistics was strained in the first half of the 20th Graunt’s book had close affinities in both con- century, as the descriptive statistics tradition, tent and objectives to several works by his close associated with Karl Pearson, was being friend William Petty (1623–87) on ‘Political transformed into modern (frequentist) statistical Arithmetick’ published during the 1670s and inference in the hands of Fisher (1922, 1925, 1680s; Graunt and Petty are considered joint foun- 1935a, 1956) and Neyman and Pearson (1933), ders of the ‘political arithmetic’ tradition (Redman and Neyman (1935, 1950, 1952). During the sec- 1997). The fact that Graunt had no academic ond half of the 20th century this relationship credentials and published only the single book eventually settled into a form of uneasy coexis- led to some speculation in the 1690s, which has tence. At the dawn of the 21st century there is a persisted to this day, that Petty was the real author need to bring the two disciplines closer together of The Bills of Mortality. The current prevailing by implementing certain methodological lessons view (see Greenwood 1948; Kreager 1988) is that overlooked during the development of modern Petty’s potential influence on Graunt’s book is statistics. marginal at best. Stone aptly summarizes this view as follows: The 17th Century: Political Arithmetic, Graunt was the author of the book associated with the Promising Beginnings his name. More than likely, he discussed it with his friend; Petty may have encouraged him to write it, contributed certain passages, helped obtaining the If one defines statistics broadly as ‘the subject Bills for the county parish...at Romsey, the church matter of collecting, displaying and analysing in which Petty’s baptism is recorded and in which data’, the roots of the subject are traditionally he is buried; he may even have suggested the means ’ – of interpolating the numbers of survivors between traced back to John Graunt s (1620 74) Natural childhood and old age. But all this does not amount and Political Observations upon the Bills of Statistics and Economics 3
to joint let alone sole authorship. (Stone 1997, the government’–and was instrumental in the p. 224) development of both statistics and economics Hull (1899), one of Petty’s earliest biographers (see Redman 1997, p. 143). The timing of this and publisher of his works, made a strong case emphasis on quantitative measurement and the against Petty being the author of the ‘Bills of collecting of data was not coincidental. The Mortality’ by comparing his methodological empiricist turn pioneered by Francis Bacon approach to that of Graunt: (1561–1626) had a crucial impact on intellectual circles such as the London Philosophical Society Graunt exhibits a patience in investigation, a care in checking his results in every possible way, a reserve and the British Association, with which Graunt in making inferences, and a caution about mistaking and Petty were associated – these circles included calculation for enumeration, which do not charac- Robert Boyle, John Wallis, John Wilkins, Samuel ’ terize Petty s work to a like degree. Hartlib, Christopher Wren and Isaac Newton. As The spirit of their work is often different when no question of calculation enters. Petty sometimes summarized by Letwin: appears to be seeking figures that will support a The scientific method erected by Bacon rested on conclusion which he has already reached; Graunt two main pillars: natural history, that is, the collec- uses his numerical data as a basis for conclusions, tion of all possible facts about nature, and induction, declining to go beyond them. He is thus a more a careful logical movement from those facts of careful statistician than Petty, but he is not an econ- nature to the laws of nature. (Letwin 1965, p. 131) omist at all. (Hull 1899, pp. xlix and lxxv) Graunt and Petty were also influenced by phi- Both Graunt and Petty used limited data to losopher John Locke (1632–1704), through per- draw conclusions and make predictions about sonal contact. Locke was the founder the British the broader populations, exposing themselves to empiricist tradition, which continued with George severe criticisms as to the appropriateness and Berkeley (1685–1753) and David Hume reliability of such inferences. For instance, using (1711–76). Indeed, all three philosophers wrote data on christenings and burials in a single county extensively on political economy as it relates to parish in London, they would conjure up esti- empirical economic phenomena, and Locke is mates of the population of London (which credited with the first use of the most important included more than 130 parishes), and then on example of analytical thinking in economics, the the basis of those estimates, and certain contest- demand-supply reasoning in determining price able assumptions concerning mortality and fertil- (see Routh 1975). ity rates, proceed to project estimates of the Graunt’s and Petty’s successors in the political population of the whole of England. The essential arithmetic tradition, Gregory King (1648–1712) difference between their approaches is that Graunt and Charles Davenant (1656–1714) continued to put enough emphasis on discussing the possible emphasize the importance of collecting data as the sources of error in the collection and compilation only objective way to frame and assess sound of his data, as well as in his assumptions, enabling economic policies. Their efforts extended the the reader to assess the reliability (at least qualita- pioneering results of Grant and Petty and provided tively) of his inferences. Petty, in contrast, was an improved basis for some of the original pre- more prone to err on the side of political expedi- dictions (such as the population of England), but ency by drawing inferences that would appeal to they did not provide any new methodological the political powers of his time (see Stone 1997). insights into the analysis of the statistical regular- Graunt and Petty considered statistical analysis ities originally enunciated by Graunt. The a way to draw inductive inferences from observa- enhanced data collection led to discussions of tional data, analogous to performing experiments how certain economic variables should be mea- in the physical sciences (see Hull 1899, p. lxv). sured over time, and a new literature on index Political arithmetic stressed the importance of a numbers was pioneered by William Fleetwood –‘ new method of quantitative measurement the (1656–1723). The roots of national income fi art of reasoning by gures upon things relating to accounting, which eventually led to the current 4 Statistics and Economics standardized macro-data time series, can be traced of particular men, to the consideration of others’ back to the efforts of these early pioneers in polit- (Hull 1899, p. 244). ical arithmetic (see Stone 1997). English political institutions, including the According to Hald: House of Commons, the House of Lords and the His [Graunt’s] life table was given a probabilistic monarchy, took full advantage of the newly interpretation by the brothers Huygens; improved established methods of political arithmetic and life tables were constructed by de Witt in the Neth- encouraged, as well as financed, the collection of erlands and by Halley in England and used for the new data as needed to consider specific questions computation of life annuities. The life table became a basic tool in medical statistics, demography, and of policy (see Hoppit 1996). Putting these actuarial science. (Hald 1990, p. 1034) methods to the (almost exclusive) service of pol- icy framing by politicians carried with it a crucial The improved life tables, with proper probabi- danger for major abuse. An inherent problem for listic underpinnings, were to break away from the social scientists in general has always been to main political arithmetic and become part of a distinguish between inferences relying on sound statistical/probabilistic tradition that would scientific considerations and those motivated by develop independently in Europe in the next two political or social preferences and leanings. centuries, giving rise to a new literature on life The combination of (a) the absence of sound tables and insurance mathematics (see Hald probabilistic foundations that would enable one to 1990). distinguish between real regularities and artefacts, A methodological digression. This was a cru- and (b) the inbuilt motivation to abuse data in an cial methodological development for data analysis attempt to make a case for one’s favourite policies, fi because it was the rst attempt to provide proba- led inevitably to extravagant and unwarranted bilistic underpinnings to Graunt’s statistical regu- speculations, predictions and claims. These indul- larities. Unfortunately, the introduction of gences eventually resulted in the methods of polit- probability in the life tables was of limited scope ical arithmetic losing credibility. The extent of the and had no impact on the broader development of damage was such that Greenwood, in reviewing political arithmetic, which was growing during ‘Medical Statistics from Graunt to Farr’, argued: the 18th century without any concerns for any probabilistic underpinnings. Without such under- One may fairly say on the evidence here summa- rized that the eighteenth-century political arithme- pinnings, however, one cannot distinguish ticians of England made no advance whatever upon between real regularities and artifacts. the position reached by Graunt, Petty and King. They were second-rate imitators of men of genius. (Greenwood 1948, p. 49)
The 18th Century: The Demise An important component of the evidence pro- of Political Arithmetic vided by Greenwood was the ‘population contro- versy’, which often involved idle speculation in At the dawn of the 18th century political arith- predicting the population of England. This spec- metic promised a way to provide an objective ulation began with Graunt with a lot of cautionary basis for more reliable framing and assessment notes attached, but it continued into the 18th cen- of economic and social policies. As described by tury with much less concern about the possible Petty, the method of political arithmetic replaces errors that could vitiate such inferences. The dis- the use of ‘comparative superlative words, and cussions were from two opposing schools of intellectual arguments’ with ‘number, weight, or thought: the pessimists, who claimed that the pop- measure; to use only arguments of sense; and to ulation was decreasing, and the optimists, who fl consider only such causes as have visible founda- argued the opposite; their con icting arguments tions in nature, leaving those that depend on the were based on the same bills of mortality popu- mutable minds, opinions, appetites, and passions larized by Graunt. Neither side had reliable evi- dence for its predictions because the data provided Statistics and Economics 5 no sound basis for reliable inference. All predic- At this point it should be emphasized that the tions involved highly conjectural assumptions of terms induction and deduction had different con- fertility and mortality rates, the average number of notations during the 18th century, and care should people living in each house, and so on. The acri- be taken when interpreting some of the claims of monious arguments between the two sides that period (see Redman 1997). Despite the criti- revealed the purely speculative foundations of all cisms by leading political economists of the such claims and contributed significantly to the inductive method, broadly understood as using eventual demise of political economy (see Glass the data as a basis of inference, the tradition of 1973, for a detailed review). collecting, compiling and charting data as well as The above quotation from Greenwood might drawing inferences concerning broad tendencies be considered today as an exaggeration, but it on such a basis, continued to grow throughout the describes accurately the prevailing perception at 18th and 19th centuries, and was influential in the the end of the 18th century. An unfortunate con- development of political economy. Some political sequence of disparaging the methods of political economists such as Thomas Malthus arithmetic was the widely held interpretation that (1766–1834) and John McCulloch (1789–1864) it provided decisive evidence for the ineffective- continued to rely on the British empiricist tradi- ness of Bacon’s inductive method. Indeed, one can tion of using data as a basis of inference, but were argue that this cause was instrumental in the at great pains to separate themselves from the 18th timing of the emergence of political economy at century’s discredited political arithmetic tradition. the end of the 18th century, as the first social Indeed, the leading political economists of that science to break away from political period, including Adam Smith and David Ricardo arithmetic. Adam Smith (1723–90) declared: ‘I (1772–1823), used historical data extensively in have no great faith in political arithmetick’ support of their theories, conclusions and policy (1776, p. 534). James Steuart (1712–80) was recommendations developed by deductive argu- even more critical: ments (see Backhouse 2002a). Instead of appealing to political arithmetic as a At the close of the 18th century, the only bright check on the conclusions of political economy, it methodological advance in the withering tradition would often be more reasonable to have recourse to of political arithmetic was provided by William political economy as a check on the extravagances Playfair’s (1759–1823) The Commercial and of political arithmetic. (quoted by Redman 1997) Political Atlas, published in 1786. This book ele- During the late 18th century, political economy vated the analysis of tabulated data to a more defined itself by contrasting its methods with sophisticated level by introducing the power of those of political arithmetic, arguing that it did graphical techniques in displaying and analysing not rely only on tables and figures in conjunction data. Playfair introduced several innovating tech- with idle speculation, but was concerned with the niques such as hachure, shading, colour coding, theoretical issues, causes and explanations under- and grids with major and minor divisions of both lying the process that generated such data. Politi- axes to render the statistical regularities in the data cal economists contrasted their primarily even more transparent. In a certain sense, the deductive methods to the discredited inductive graphical techniques introduced by Playfair methods utilized by political arithmeticians. As made certain empirical regularities more transpar- argued by Hilts: ent and rendered certain conclusions easier to Of importance to the history of statistics in England draw. The graphs in this book represent economic was the fact that the political economists were fully time series, measuring primarily English trade conscious of their deductive proclivities and saw (imports/exports) with other countries during the political economy as methodologically distinct 18th century. Indeed, Playfair’s writings were from the inductive science of statistics. (Hilts fi 1978, p. 23) mainly on political economy; his rst book, Reg- ulation of the Interest of Money, was published in 1785 (see Harrison 2004). 6 Statistics and Economics
In what follows the developments in probabil- these probabilistic underpinnings was not made ity theory will be discussed only when they per- explicit, however, until the early 1920s (see sec- tain to the probabilistic underpinnings of tion “The Fisher–Neyman–Pearson Approach”). statistical analysis; for a more detailed and bal- Indeed, the role of the IID assumptions is often anced discussion see Hald (1990, 1998, 2007). misunderstood to this day. For instance, Hilts The probabilistic underpinnings literature on argues: probability developed independently from politi- Mathematically the theorem stated [LLN], in very cal arithmetic in England, and there was no inter- simplified language, that an event which occurs action between the two until the mid-19th century. with a certain probability, appears with a frequency Viewed from today’s vantage point, the pri- approaching that probability as the number of observations is increased. (Hilts 1973, p. 209) mary problem with Grant’s inferences based on data pertaining to a single parish in London, was Strictly speaking, the LLN says nothing of a how ‘representative’ the data were for the popu- sort, because, unless the trials are IID, the result lation of London as a whole, which included more does not follow. This insight was clearly articu- than 130 other parishes. This problem was for- lated by Uspensky: malized much later in terms of whether the data It should, however, be borne in mind that little, if can be realistically viewed as a ‘random sample’ any, value can be attached to the practical applica- from the population of London. Defining what a tions of Bernoulli’s theorem, unless the conditions random sample is, however, requires probability presupposed in this theorem are at least approxi- mately fulfilled: independence of trials and constant theory, which was not adequately understood until probability of an event for every trial. (Uspensky the late 19th century (see Peirce 1878). 1937, p. 104) Jacob Bernoulli. The first important result Laplace. The first successful attempt to inte- relating to the probabilistic underpinnings of sta- grate data analysis with the probabilistic under- tistical regularities was Jacob Bernoulli’s (1654–1705) Law of Large Numbers (LLN), pinnings should be credited to Pierre-Simon Laplace (1749–1827), a famous French mathema- published posthumously in 1713 by his nephew tician and astronomer, and Thomas Bayes Nicolas Bernoulli (1687–1759). Bernoulli’s theo- (1702–61), a British mathematician and Presbyte- rem showed that under certain circumstances, the rian minister. In papers published in 1764 and relative frequency ofX the occurrence of a certain 1 n m 1765 (see Hald 2007) respectively, they proposed event A, say X ¼ Xi ¼ (m occurrences n i¼1 n the first inverse probability (posterior-based) of {X = 1} and n m occurrences of {X = 0} in i i interval for p for the form p of the form ‘p is in n trials) provides an estimate of the probability ðÞx ½ ejx ’ for some e > 0, by assuming a prior ℙ(A) p whose accuracy increases as n goes to distribution p U(0,1) that is p is a uniformly infinity. In modern terminology X constitutes a distributed random variable (see Hacking 1975). consistent estimator of p. Bernoulli went on to use This gave rise to the inverse probability approach this result in an attempt to provide an interval (known today as the Bayesian approach) to statis- estimator of the form: p is in X e for some tical inference, which was to dominate statistical e > 0, but his estimator was rather crude (see induction until the 1920s, before the Fisherian Hald 1990). revolution. In 1812 Laplace (see Hald 2007) also A methodological digression. The circum- provided the first frequentist interval estimator of stances assumed by Bernoulli were specified in p of the form p is in ðÞX e for some e > 0. The terms of the trials being independent and identi- difference between this result and a similar result cally distributed (IID). It turned out that the same by Bernoulli is that Laplace used a more accurate probabilistic assumption defines the notion of a approximation based on convergence in distribu- random sample mentioned in relation to the prob- tion as the basis of his result; the first central limit abilistic underpinnings concerning Graunt’s sta- theorem supplying an asymptotic approximation tistical regularities, though the two literatures were developing independently. The role of Statistics and Economics 7 of the binomial by the Normal distribution (see The Statistical Society of London has been Hald 1990). established for the purposes of procuring, arrang- ing, and publishing Facts calculated to illustrate the condition and prospects of the Society. (Journal of the Statistical Society of London 1834,p.1) The 19th Century: Political Economy The seal on the cover of the Journal of the and Statistics Statistical Society of London (JSSL) was a wheatsheaf around which was written ‘aliis The demise of political arithmetic by the early exterendum’ (‘to be threshed by others’). That is, 19th century was instrumental in contributing to the aim of the society is to painstakingly gather the the creation of two separate fields: political econ- facts and let others draw whatever conclusions omy and statistics. Political economy was created might be warranted: to provide more reasoned explanations for the The Statistical Society will consider it to be the first causes and contributing factors giving rise to eco- and most essential rule of its conduct to exclude nomic phenomena. Statistics was demarcated by carefully all Opinions from its transactions and the narrowing down of the scope of political arith- publications – to confine its attention rigorously to – metic in an attempt to cleanse it from the facts and, as far as it may be found possible, to facts which can be stated numerically and arranged unwarranted speculation that undermined its cred- in tables. (JSSL 1834, pp. 1–2) ibility during the 18th century. Of particular interest is the way the statement of the aims of the society separated statistics from The Statistical Society of London political economy: Given their common roots, the first institution fi The Science of Statistics differs from Political created to foster the development of the eld of Economy because although it has the same end in statistics, the Statistical Society of London, was view, it does not discuss causes, nor reason upon created in 1834 with the active participation of probable effects; it seeks only to collect, arrange, several political economists, including Thomas and compare, that class of facts which alone can – form the basis of correct conclusions with respect to Malthus and Richard Jones (1790 1855), who, social and political government. (JSSL 1834,p.2) together with John Drinkwater (1801–51), Henry Hallam (1777–1859) and Charles Babbage The overwhelming majority of the published (1791–1871), were to found the Society after papers in the JSSL were in the political arithmetic some prompting from Quetelet, who visited tradition of Graunt, relating primarily to eco- England in 1833. Other political economists who nomic, medical and demographic data, with two played very active roles in the early stages of the major improvements: ameliorated methods for the Society included Thomas Tooke (1774–1858), collection and tabulation of data giving rise to John R. McCulloch (1789–1864) and Nassau more accurate and reliable data, and more careful Senior (1790–1864). The first council included reasoning being used to yield less questionable notable personalities such as Earl FitzWilliam inferences. This is particularly true for data relat- (1748–1833), William Whewell (1794–1866), ing to life tables and mortality rates associated G.R. Porter (1792–1852) and Samuel Jones- with epidemics. The best examples of such an Loyd (1796–1883). output are given by William Farr (1807–83), In an attempt to protect themselves from the who is considered to be the founder of medical disrepute on speculation based on data brought statistics because his analysis of such data con- about by political arithmeticians, the new society tributed to medical advances and crucial changes was founded upon the explicit promise to put the in policies concerning public health (see Green- emphasis, not on inference, but upon the collec- wood 1948). For a more extensive discussion of tion and tabulation of data of relevance to the the methodological and institutional develop- state. The founding document stated: ments associated with data collection and 8 Statistics and Economics