Inferring Causality Through Counterfactuals in Observational Studies Some Epistemological Issues

November 15. 2010 Inferring Causality through Counterfactuals in Observational Studies Some epistemological issues Federica RUSSOa, Guillaume WUNSCHb and Michel MOUCHARTc a Philosophy, University of Kent, UK b Demography, University of Louvain (UCLouvain), Belgium c Statistics, University of Louvain (UCLouvain), Belgium Abstract This paper contributes to the debate on the virtues and vices of counterfactuals as a basis for causal inference. The goal is to put the counterfactual approach in an epistemological perspective. We discuss a number of issues, ranging from its non-observable basis to the parallelisms drawn between the counterfactual approach in statistics and in philosophy. We argue that the question is not to oppose or to endorse the counterfactual approach as a matter of principle, but to decide what modelling framework is best to adopt depending on the research context. 1. Introduction and Background Arguably, there are two reasons why causal analysis is important in science as well as in everyday life. One is that if we know the causes we are more likely to provide a better explanation and understanding of a given phenomenon. The other is that if we know the causes, we are more likely to take better action or intervention, that is to design, for example, more efficient social or public health policies or to advise on individual treatments. Controversial as it may sound at first glance, there is a sense in which causal inference is an easy task: it suffices to consider idealised situations. Identify the putative causes and effects, manipulate the causes holding fixed anything else, and see what happens. This is, in essence, the pillar of Baconian science. Without going into the historical details of the revolution Francis Bacon made in scientific method, the reader may like to be reminded that, with Bacon, science becomes a scientia operativa (Klein 2008 and 2009): to get to know about the world the scientist does not just passively observe it, but s/he interacts with it. The modern scientist is a ―maker‖ (Ducheyne 2005): s/he performs experiments, that is s/he actively manipulates factors to find out what causes what. But as science has evolved, methods have become more sophisticated too. A powerful tool introduced by Fisher in the early 1920s is randomisation. For the sake of history, the first historically recognised randomised experiment was run by Peirce and Jastrow (1885) in psychometrics, but randomisation had to wait nearly 50 years to receive an adequate conceptualisation and discussion (on this point see for instance Rescher (1978) and Hall (2007)). Randomisation, in the original thought of Fisher, is a means for eliminating bias in the results due to uncontrolled differences in experimental conditions. Whilst we know that in laboratory experiments ideal conditions are more often met because uncontrolled variations in the 1 environment are much better known, this is certainly not the case in agricultural studies where Fisherian randomisation originated, nor in social and biomedical contexts where phenomena and environmental conditions are highly complex. Randomisation is somehow a heir of Baconian science because it ultimately aims to make causal inference reliable implementing the same ideas holding up the Baconian method: manipulation and control. Randomness, in fact, increases the efficiency of the experiments in the sense that, because unwanted sources of variation are controlled for, the sought level of significance is achieved in fewer trials. Also, by ensuring that unwanted sources of variation are minimised or even eliminated, randomisation ensures that only the cause is manipulated. (See Fisher 1925 and 1935 for the original formulation of randomisation in experimental design, and Rescher 1978, Hacking 1988 and Hall 2007 for historical reconstructions and critical appraisals of the meaning and development of Fisherian randomisation.) However, as it happens, most studies in the social sciences are constructed on the basis of observational data and not experimental ones. The reason is that randomisation is often unethical or simply not feasible. This makes the reliability of observational studies a real challenge because not only human populations are highly heterogeneous both with respect to know/unknown and non-observable/non-observed factors, but also because, if randomisation cannot be performed, there is less grip on the sources of ‗unwanted variations‘—as Fisher called them—and on the mechanisms of assignment. Here is an example that illustrates some difficulties related to heterogeneity; it stems from the work of Tietze, Potter, Henry, and others in the 1970s. In developed societies, women using contraceptives may have a higher fertility than non-contracepting women of fertile ages. This paradoxical result has been explained by the fact that many non-contracepting women are probably sterile or sub-fecund and therefore do not have recourse to contraception, in order to conceive. The measure of the use-effectiveness of methods of fertility control must therefore take the heterogeneity of the fecundity of the population into account, e.g. by comparing the fertility of current contraceptors to that of contraceptors who stop using birth control in order to conceive. The groups one compares should therefore be as similar as possible, except for the fact that one group experiences the putative cause and the other does not. The best situation would then be the following: to compare, at the same time, the outcome in the group experiencing the treatment to the outcome in the same group not taking the treatment. In this case, the two groups would indeed be perfectly identical, except for the fact that one experiences the cause and the other not. Needless to say, it is not possible that the same individuals take and do not take treatment at the same time. But this practical difficulty does not prevent us from imagining what would happen if the same individuals did take and did not take the treatment. It is this way of reasoning that led Donald Rubin (1974) to develop a counterfactual framework of causality that we will briefly present in section 2. Rubin’s Causal Model, as it is now called (Holland, 1986), has become a standard reference in the literature on causality. The strength of the counterfactual approach seems to lie in the attempt to implement the pillars of Baconian science—that is those principles that most ensure the reliability of causal inference: manipulation and control. On the one hand, if the two groups (actually, the same group) only differ as to whether individuals receive the treatment or not, then the action of possible confounders is minimised if not nullified. On the other hand, once we hold fix everything else, the only factor subject to manipulation—albeit ideal manipulation—is the putative cause. Because in social science it is not always possible to manipulate or randomise, 2 the counterfactual framework apparently comes to rescue because it somewhat implements the same ideas of Baconian science—namely manipulation and control—without requiring actual manipulation. However, the counterfactual approach has its share of problems too, highlighted both by scientists and by philosophers (see later section 3). This paper adds to the debates on the virtues and vices of counterfactuals, but does not aim to take definite side with the camp of the counterfactualists or the camp of the anti- counterfactualists. The general goal of the paper is to put the counterfactual approach in perspective, from an epistemological point of view. The starting point of the paper is based on two main ideas. Firstly, there is no unanimous consensus on a unique concept of causality: it is the task of the scientist to construct an epistemologically sound concept. When a suggested concept of causality cannot cope with an important and relevant scientific issue, the concept should be amended. Secondly, we believe that the root of the difficulty, when building a concept of causality in social sciences, lies in the intertwined issue that individuals are heterogeneous and that the latter may be due to differences in potentially causal variables. This is so because social processes are typically complex, involving multiple causes- multiple effects mechanisms. Our position, that we shall develop and articulate in sections 3 and 4, can be summarised as follows. The question is not to oppose the counterfactual approach, randomisation or manipulation as a matter of principle. The possibility and, consequently, the decision to use manipulation or counterfactuals or to randomise in a given study depends on practical aspects such as the kind of data (for instance experimental or observational) the scientist has access to. On a more epistemological tone, our view is that the concept of causality should not necessarily rely on the concepts of counterfactuality or of manipulability. This, as we shall explain in more detail later, is for several reasons. One reason is that that there may be concepts other than (or in addition to) counterfactuality and manipulation to be used in the explication of the concept of causality. Another reason is that the counterfactual approach should be viewed as one among various possible methods to search for effects of causes and that there is no principled reason why it should necessarily be involved. Actually, causal analysis encompasses many more methods and approaches than just counterfactual models or randomised trials. This is not just a contingency due to the richness of scientific methodology, but it is also due to the fact that one may need different causal methods depending on whether the goal is to explain a phenomenon, to measure effects of known causes, to take action in response to the causal knowledge gathered, to assign causes to observed outcomes, etc. The paper is organised as follows. In section 2 we recall the main features of the counterfactual approach. In section 3 we discuss six issues concerning the counterfactual approach. The first two issues have already been widely discussed in the literature.

Load more