Comment on “The Environment and Disease: Association Or Causation?”

Comment on “The Environment and Disease: Association or Causation?” Christopher J. Phillips, Joel Greenhouse Observational Studies, Volume 6, Issue 2, 2020, pp. 24-29 (Article) Published by University of Pennsylvania Press DOI: https://doi.org/10.1353/obs.2020.0005 For additional information about this article https://muse.jhu.edu/article/793346/summary [ Access provided at 2 Oct 2021 13:57 GMT with no institutional affiliation ] Observational Studies 6 (2020) 24-29 Submitted 8/19; Published 1/20 Comment on \The Environment and Disease: Association or Causation?" Christopher J. Phillips [email protected] Department of History Joel Greenhouse [email protected] Department of Statistics Carnegie Mellon University Pittsburgh, PA 15289 A. B. Hill's 1965 discussion of the relationship between statistical association and causality has become so well known among epidemiologists, it is easy to treat his list of relevant \aspects" as timeless and context-less, as a set of logical postulates. In this comment, we instead want to place Hill's article in its historical context, both that of the author, and even more so that of epidemiology's understanding of statistical association and causation. The most obvious, but often neglected, context is that the text was first presented as the Presidential Address to the Royal Society of Medicine's Section of Occupational Medicine. Presidential addresses, particularly when given by senior colleagues, are opportunities for reflection beyond a typical research article. There's nothing necessary about this reflectivity, however, and the following year's presidential address by R.S.F. Schilling was instead an impassioned account of the dangers of trawler fishing (Schilling, 1966). This was also not the first such address Hill himself had given { his May 1954 address to the Epidemiology and Preventive Medicine section eschewed broad generalization in favor of a more traditional account of the expected versus observed cases of interwar polio in England and Wales, broken down by county and locale (Hill, 1954). The important contextual difference in October 1964 was that the Occupational Medicine section had just been formed, and Terence Cawthorne's opening address noted that the section was intended not just for physicians and surgeons but also scientists from a range of disciplines (Cawthorne, 1965). As subsequent speakers at the first meeting made clear, the traditional focus on industrial medicine was still present, but the renaming as \occupational medicine" was a move indicating its broader audience, from sports physicians to education workers. In this context, Hill set out to address a question that by its very nature was interdisciplinary and of great interest to this larger group: what is the relationship between an environmental agent and disease, and when is it appropriate to identify such a relationship as causal? This was a pressing issue for the new section, Hill wrote, because the characterization of the relationship between occupational conditions and sickness is \fundamental" and yet problematic. When is a respiratory illness among workers, he asks, simply associated with dust in the environment, and when is it caused by it? As Hill well knew, the question of causation cannot be answered solely by any one field, instead, by drawing on physiol- c 2020 Christopher Phillips and Joel Greenhouse. Comment on \The Environment and Disease: Association or Causation?" ogy, statistics, labor relations, public health, and pulmonology, Hill sought to portray the question as essentially interdisciplinary and therefore of great interest to the new section. Hill himself had long been concerned with causal questions. Though he modestly doesn't cite his own work, he had written a long report on the prevalence and origins of respiratory illnesses among Lancashire's cotton cardroom operators in the 1920s for the Medical Research Council. The cleaning of raw cotton prior to spinning was known to cast off dust and fibers, and in 1927 the UK's home secretary established an investigation into \whether, and if so to what extent, dust in cardrooms in the cotton industry is a cause of ill-health or disease among cardroom operatives" (Hill, 1930). It was known already that there were health effects from the cleaning of the carding machines themselves, and the owners of the mills contended that the mechanical methods installed to remove dust had remedied the problem, while operatives contended that cleaning cotton carried health risks distinct from the cleaning of the machines. Hill was tasked with finding out what kind of ill effects, if any, there were from the cleaning of cotton, and whether these were distinct from other respiratory diseases known to occur at different stages of the process, and what if anything could be done about it. Hill's choice of this same example in 1964 is a clear indication that the question of \environment and disease" was one that he had been contemplating for a long time. Hill's role within industrial health efforts of the 1920s and 1930s put him in contact with colleagues who themselves saw an essential role for statistics in making causal claims. Unlike then-contemporary biomedicine's focus on bacteria and other microscopic agents of infec- tion, industrial and environmental health were areas in which \causal factors" and \causal relationships" were known to be multifactored, complicated, and often hidden. Work in these areas using statistical rates to make claims about causality, from the relationship of housing and health to that of infant mortality, goes back at least to William Farr and Flo- rence Nightingale. Later, Udny Yule had used statistical methods, specifically a regression equation, to try to pinpoint causes of pauperism in England at the turn of the century.1 Hill's work at the Medical Research Council along these lines was initially overseen by Major Greenwood, an influential statistician whose own training, combining statistics (he stud- ied under Karl Pearson) with physiology and preventive medicine, exemplified the growing importance of data for measuring associations between health and environmental agents (Higgs, 2000). Indeed, there's a good argument that Hill inherited the mantle of statistics in medicine from this earlier generation. Their approaches { emphasizing careful study design and data collection techniques, relying on relatively conservative uses of statistics, avoiding formal inferences tests and elaborate models { would later characterize Hill's own approach throughout his career. With remarkable clarity in his 1964 address, Hill lays out the question of interest: Our observations reveal an association between two variables, perfectly clear- cut and beyond what we would care to attribute to the play of chance. What aspects of that association should we especially consider before deciding that the most likely interpretation of it is causation? (p. 295) Contrary to Rothman and Greenland's claim (1998) that Hill's criteria were essentially \an expansion of criteria offered previously in the landmark Surgeon General's report on 1Yule (1899) was discussed thoughtfully alongside other examples in Freedman (1999). 25 Phillips and Greenhouse Smoking and Health," Hill's approach for distinguishing causal from non-causal associations was developed based on his longstanding experience in epidemiologic field studies. As an illustration we consider his influential paper with Richard Doll, \Smoking and Carcinoma of the Lung" (1950). This was an early case-control study of 20 hospitals in the London region of patients presenting with cancer of the lung, stomach and large bowel. The patients with carcinoma of the stomach and large bowel served as one comparison group and another comparison group were non-cancer general hospital patients, \chosen so as to be of the same sex and age as the lung-carcinoma patients." In the Discussion section, Doll and Hill synthesized the results, in language that would both implicitly and explicitly feature the relevant aspects of association, specificity, biological gradient, consistency, coherence, and plausibility: • \...the comparison of the smoking habits of patients in different groups...revealed no association between smoking and cancer of the other sites (mainly stomach and large bowel. The association therefore seems to be specific to carcinoma of the lung." • \The effect of smoking varies, as would be expected, with the amount smoked." • \How do these results fit in with other known facts about smoking and carcinoma of the lung? Both the consumption of tobacco and the number of deaths attributed to cancer of the lung are known to have increased, and to have increased largely, in many countries this century." • \As to the nature of the carcinogen we have no evidence. The only carcinogenic substance which has been found in tobacco smoke is arsenic, but the evidence that arsenic can produce carcinoma of the lung is suggestive rather than conclusive. Should arsenic prove to be the carcinogen, the possibility arises that it is not the tobacco itself which is dangerous. Insecticides containing arsenic have been used for the protection of the growing crop since the end of the last century and might conceivably be the source of the responsible factor." Clearly, Doll and Hill are systematically and logically assessing the body of evidence from their study and the existing epidemiologic literature to make a case for (or against) a causal association. In the last quotation, Doll and Hill consider an alternative explanation for the observed association

Comment on “The Environment and Disease: Association Or Causation?”

F:\RSS\Me\Society's Mathemarica

History of the Development of the ICD

The Scientific Rationality of Early Statistics, 1833–1877

Vital and Health Statistics, Series 4, No. 29

John Graunt, James Lind, William Farr, and John Snow

Laura Vaughan

Major GREENWOOD (1880 – 1949)

The Approach and Evolution of Epidemiology

History of Health Statistics

William Farr on the Cholera: the Sanitarian's Disease Theory And

The Development of the MRC Statistical Unit, 1911-1948

A Complete Bibliography of the Journal of the Royal Statistical Society, Series a Family: 1880–1889