Propensity Score Methods for Causal Inference

John Pura BIOS790

October 2, 2015

John PuraBIOS790 Propensity Score Methods for Causal Inference Causal inference

Philosophical problem, statistical solution Important in various disciplines (e.g. Koch’s postulates, , Granger ) Good reference on history of causal inference: Paul Holland “Statistics and Causal Inference” JASA, 1986

John PuraBIOS790 Propensity Score Methods for Causal Inference What can we estimate?

Potential Outcomes Framework (Rubin’s )

Notation:

Z (1=treated, 0=control), baseline covariates X =(X1,...,Xp), outcome Y potential outcomes Y0, Y1 We observe (Z, Y , X)foranindividual Y = ZY (1)+(1 Z)Y (0) ≠ Causal eect of treatment: Y (1) Y (0) ≠ Average causal eect: = E[Y (1) Y (0)] ≠

John PuraBIOS790 Propensity Score Methods for Causal Inference What can we estimate?

Average causal eect ACE or ATE = E[Y (1) Y (0)]) All ≠ ACE or ATT = E[Y (1) Y (0) Z = 1]) Exp ≠ | ACE or ATU = E[Y (1) Y (0) Z = 0]) Un ≠ | Estimand and statistical methods depends on the study goal/question

John PuraBIOS790 Propensity Score Methods for Causal Inference Assumptions

1. ZprecedesY 2. Stable Unit Treatment Value Assumption (SUTVA) non-interference no variation in treatment 3. Strongly Ignorable Treatment Assigment (SITA) 0 < P(Z = 1 X) < 1 (this is the propensity score) | (Y (0), Y (1)) Z X (very strong assumption) ‹ | no unobserved confounders

John PuraBIOS790 Propensity Score Methods for Causal Inference Randomized Controlled Trials vs. Observational Studies

RCTs

Treatment eects on outcome considered as causal Z is determined for each participant at random, (Y (0), Y (1)) Z ‹ E[(Y Z = 1) (Y Z = 0)] is unbiased estimate of | ≠ | ATT = ATE

Observational Study

Z is not controlled, (Y (0), Y (1)) Z ”‹ E(Y Z = 1)=E(Y (1) Z = 1) = E(Y (1)). Cannot obtain | | ” unbiased estimate by direct comparison. But...

John PuraBIOS790 Propensity Score Methods for Causal Inference Potential Solution

In observational studies, assuming SITA assumption is met then treatment assignment, Z, among individuals with particular X is essentially random and independent of potential outcomes Rosenbaum and Rubin (1983) - conditioning on the propensity score (PS) we can identify E(Y (0)) and E(Y (1)) from the observed data (Z, Y , X)andultimatelyestimate.

John PuraBIOS790 Propensity Score Methods for Causal Inference Propensity Score

Austin, 2011: “The propensity score is a balancing score: conditional on the propensity score, the distribution of observed baseline covariates will be similar between treated and untreated subjects” This is a large sample property Unknown in practice, but can be estimated from the data, given some assumptions on e(X) (e.g. parametric regression model, CRTs) . Mathematically: e(X)=P(Z = 1 X).R&Rshowedthat | X Z e(X) and in addition to the SITA assumption, ‹ | (Y (0), Y (1)) Z e(X). ‹ | For theoretical properties see R&R (1983) and Lunceford and Davidian (2004)

John PuraBIOS790 Propensity Score Methods for Causal Inference The Propensity Score Model

Goal: Covariate balance

Popular method for estimating PS is logistic regression, though others exist (e.g. tree-based methods, random forests, neural networks, etc.) Regress logit[P(Z = 1 X)] on X and obtain predicted | probabilities (ˆe(X)) R&R (1984) and Austin 2011 describe an iterative approach: 1. Specify an initial model to estimate ˆe(X) 2. Perform diagnostics to assess covariate balance for each treatment 3. Modify PS by adding covariates, interactions, or using non-linear terms 4. Important: Each step should not be motivated by statistical significance but by objective

John PuraBIOS790 Propensity Score Methods for Causal Inference The Propensity Score Model

Goal: Covariate balance What covariates do we include?

Selection driven by subject-matter knowledge Only baseline variables Include all confounders and possible non-linear transformations (e.g. interactions). Overfitting generally not an issue (unless treatment is uncommon) Always include variables that aect the outcome even if they don’t aect treatment assignment (Brookhart et al. (2006))

John PuraBIOS790 Propensity Score Methods for Causal Inference Diagnostics

How do we know the PS model has been adequately specified?

Assess standardized dierences of each covariate between treatment groups (very useful) Assess PS distributions by treatment (need common support condition) Compare distributions of the covariates between treatments Varies with PS method Dicult in practice with high dimensional data Assess the sensitivity of study conclusions to the SITA assumption.

John PuraBIOS790 Propensity Score Methods for Causal Inference Methods utilizing PS

Matching Stratification Inverse PS weighting Covariate adjustment by PS

PS methods allow for estimation of the marginal treatment eect. The first three separate the design of the study from the analysis of the study. Can do subsequent regression adjustment to eliminate residual imbalance in prognostically important covariates after first three PS methods

John PuraBIOS790 Propensity Score Methods for Causal Inference Matching

Simple formulation for ATT For each treated subject, select single untreated subject (without replacement) with same value of ˆe(X) or its logit (R&R, 1985) Take dierence of outcomes for the matched pair and average over all matched pairs Calculating ATE and ATU require slightly dierent sampling, possibly with replacement Advantage: Eliminates large proportion of systematic dierences in baseline characteristics between treated and untreated subjects Disadvantage: Inexact matching may lead to bias. Unmatched individuals are discarded, leading to loss in statistical power. Discarding individuals may also alter our estimand (Hill, 2008)

John PuraBIOS790 Propensity Score Methods for Causal Inference Stratification

Easily estimate ATT: Create quantiles (e.g. quintiles) of the PS values, thereby dividing the subjects into equal-sized strata Within each stratum estimate treatment eect Calculate weighted average of within-strata estimates of treatment eect. Weight of each stratum is simply the percent of the quantile Estimating ATE and ATU require weighting by fraction of treated or untreated individuals, respectively, per stratum Advantage: Easy to construct and estimate causal eects. Disadvantage: Small number of strata may result in residual within the strata, resulting in bias. ATT estimates largely biased (compared to weighting)

John PuraBIOS790 Propensity Score Methods for Causal Inference Inverse weighting

Weighted linear regression of outcome on treatment where Z 1 Z w = + ≠ w1 w2

For ATE, w1 = e(X), w2 = 1 e(X);forATT,w1 = 1, e(X) ≠1 e(X) ≠ w2 = 1 e(X) ;ForATU,w1 = e(X) , w2 = 1. (Morgan & Todd, 2008)≠ Advantage: Uses all available data; Can deal with more complex non-linear link functions (e.g. odds ratio); generally less biased than stratification (Lunceford & Davidian, 2004) Disadvantage: An individual with PS close to 0 or 1 will have unstable weights, leading to potentially spurious treatment eects with high variance and wide CIs.

John PuraBIOS790 Propensity Score Methods for Causal Inference Covariate adjustment using PS

Fit model: E(Y Z, X)=– + —Z + “f (e(X)) (may include | interaction of Z and e(X)) Can obtain ATE, ATT, and ATU by evaluating — at dierent values of ˆe(X) Advantage: Allows for flexible relationship between PS and outcome (e.g. use of splines for PS) Disadvantage: Sensitive to whether PS has been accurately estimated. Analyst may be tempted to work toward desired or anticipated result, given that outcome is in sight.

John PuraBIOS790 Propensity Score Methods for Causal Inference Final Thoughts

PS methods can be done without reference to outcome - i.e. separate study design from analysis Balance of covariates can be easily checked PS methods more robust to model misspecification compared to traditional outcome regression (all we care about is balance) Measures a dierent quantity, namely, the marginal/population treatment eect (vs. conditional/individual treatment eect in traditional regression) Important to distinguish the two in relation to study goals Omitted variable bias aects internal validity of both approaches similarly Strategy so far is to balance covariates. Another idea is to find an “instrument”" S that is randomly assigned and aects Y only through Z

John PuraBIOS790 Propensity Score Methods for Causal Inference