Defining the Study Population for an Observational Study to Ensure

Total Page:16

File Type:pdf, Size:1020Kb

Defining the Study Population for an Observational Study to Ensure Statistics in the Biosciences manuscript No. (will be inserted by the editor) Defining the Study Population for an Observational Study to Ensure Sufficient Overlap: A Tree Approach Mikhail Traskin · Dylan S. Small Received: date / Accepted: date Abstract The internal validity of an observational study is enhanced by only comparing sets of treated and control subjects which have sufficient overlap in their covariate distributions. Methods have been developed for defining the study population using propensity scores to ensure sufficient overlap. However, a study population defined by propensity scores is difficult for other investi- gators to understand. We develop a method of defining a study population in terms of a tree which is easy to understand and display, and that has similar internal validity as that of the study population defined by propensity scores. Keywords Observational study · Propensity score · Overlap 1 Introduction An observational study attempts to draw inferences about the effects caused by a treatment when subjects are not randomly assigned to treatment or control as they would be in a randomized trial. A typical approach to an observational study is to attempt to measure the covariates that affect the outcome, and then adjust for differences between the treatment and control groups in these covariates via propensity score methods [25,16], matching methods [21,13] or regression; see [18], [22] and [27] for surveys. Mikhail Traskin Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, PA 19104, USA Tel.: +215-898-8231 Fax: +215-898-1280 E-mail: [email protected] Dylan S. Small Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, PA 19104, USA E-mail: [email protected] 2 A quantity that is often of central interest in an observational study is the average treatment effect, the average effect of treatment over the whole population. However, in many observational studies, there is a lack of overlap, meaning that parts of the treatment and control group’s covariate distribu- tions do not overlap. For example, in studies of the comparative effectiveness of medical treatments, virtually all patients of certain types may receive a par- ticular treatment and virtually no patients of certain other types may receive this treatment, but there may be a marginal group of patients who may or may not receive the treatment depending upon circumstances such as where the patient lives, patient preference or the physician’s opinion [24]. When there is a lack of overlap, inferences for the average treatment effect rely on extrap- olation. This is because extrapolation is needed to estimate the treatment effect for those subjects in the treatment group whose covariates differ sub- stantially from any subjects in the control group (or vice versa). Rather than rely on extrapolation, it is common practice to limit the study population to those subjects with covariates that lie in the overlap between the treatment and control groups. In comparative effectiveness studies, these subjects in the overlap are the marginal patients for whom there is some chance that they would receive either treatment or control based on their covariates. Focusing on the average treatment effect for the marginal patients rather than all pa- tients enhances the internal validity of the study and is more informative for deciding how to treat patients for whom there is currently no definitive stan- dard of care. Knowing the average treatment effect on the currently marginal patients may also shift the margin and over a period of time, a sequence of studies may gradually shift the consensus of which patients should be treated [24]. A number of approaches have been developed for defining a study popula- tion that has overlap between the treated and control groups. An often used approach is to discard subjects whose propensity score values fall outside the range of propensity scores in the subsample with the opposite treatment [9, 27]; the propensity score is the probability of receiving treatment given the measured covariates [25]. This approach seeks to make the study population as large as possible while maintaining overlap. However, if there are areas of limited overlap, that is parts of the covariate space where there are lim- ited numbers of observations for the treatment group compared to the control group or vice versa, the average treatment effect estimate on this study popu- lation may have large variance. It may be better to consider a more restrictive study population for which there is sufficient overlap. The goal is to define a study population which is as inclusive as possible but for which there is enough overlap that, not only is extrapolation not needed, but also the aver- age treatment effect can be well estimated. Crump, Hotz, Imbens and Mitnik [8] and Rosenbaum [24] develop criterion by which to compare different choices of study populations according to this goal and then choose the study pop- ulation to optimize these criteria. Crump et al.’s criterion is the variance of the estimated average treatment effect on the study population. They show that, under some conditions, the optimal study population by this criterion is 3 those subjects whose propensity scores lie in an interval [α, 1 − α] with the op- timal interval determined by the marginal distribution of the propensity score and usually well approximated by [0.1, 0.9]. Rosenbaum’s criterion involves balancing (i) the sum of distances between the propensity scores (or related quantities) of matched treated and control subjects in the study population and (ii) the size of the study population; the criterion will be explained in detail in Section 2.3. Rosenbaum develops an algorithm that uses the optimal assignment algorithm for choosing the optimal study population according to his criterion. All of the above approaches define the study population in terms of the propensity score and related quantities. A difficulty with these approaches is that it is hard to have a clear understanding of a study population defined in terms of propensity scores. Rosenbaum, in his book Design of Observational Studies [23], states, “Rather than delete individuals one at a time based on extreme propensity scores, it is usuallybettertogobacktothecovariates themselves, perhaps redefining the population under study to be a subpopu- lation of the original population of subjects. A population defined in terms of [the propensity score] is likely to have little meaning to other investigators, whereas a population defined in terms of one or two familiar covariates will have a clear meaning.” Our goal in this paper is to develop an approach to defining a study pop- ulation that has sufficient overlap and is good by Crump et al.’s [8] criterion or Rosenbaum’s [24] criterion for judging study populations, but that is also easily described. Our approach is to use a classification tree [4] to define the study population in a way that approximates a propensity score based rule for defining the study population. The resulting study population is easily de- scribed by a tree diagram. Figure 2 provides an example of a study population that is defined in terms of a classification tree. Our paper is organized as follows. Section 2 provides the framework and assumptions that we will use as well as reviewing Crump et al.’s (Section 2.2) and Rosenbaum’s (Section 2.3) approaches to defining the study population. Section 3 describes our method. Sections 4 and 5 presents examples. Section 6 provides discussion. 2 Framework 2.1 Assumptions and Notation The framework we use is that of [25]. We have a random sample of size N from a large population. For each subject i in the sample, let Di denote whether or not the treatment of interest was received, with Di =1ifsubjecti receives 1 the treatment of interest and Di = 0 if subject i receives the control. Let Yi 0 be the outcome that subject i would have if she received the treatment and Yi be the outcome that subject i would have if she received the control; these are 1 0 called potential outcomes. The treatment effect for subject i is τi = Yi − Yi . 4 Let τ(X) denote the average treatment effect for subjects with covariates X, 1 0 τ(X)=E[Yi − Yi |Xi = X]. Di We observe Di and Yi,whereYi = Yi . In addition, we observe a vector of pre-treatment covariates denoted by Xi, where the support of the covariates is X. The propensity score e(X) for a subject with covariates X is the probability of selection into the treatment given X, e(X)=P (Di =1|Xi = X). We make the assumption that all of the confounders have been measured [25]. This means that, conditional on the covariates X, the treatment indicator is independent of the potential outcomes: 0 1 Assumption 1:Di⊥⊥(Yi ,Yi )|Xi. Assumption 1 is called the strongly ignorable treatment assignment assump- tion [25] or the unconfoundedness assumption [18]. 2.2 Minimum Variance Approach to Defining the Study Population Crump et al. [8] seek to choose the study population that allows for the most precise estimation of the average treatment effect within the study population. They show that, under some conditions, this leads to discarding observations with propensity scores outside an interval [α, 1 − α] with the optimal cut-off value determined by the marginal distribution of the propensity score. Their approach is consistent with the common practice of dropping subjects with extreme values of the propensity score with two differences. First, the role of the propensity score in the selection rule is not imposed a priori, but emerges as a consequence of the criterion, and second, there is a principled way of choosing the cutoff value α. They show that the precision gain from their approach can be substantial with most of the gain captured by using a rule of thumb to discard observations with estimated propensity score outside the range [0.1, 0.9].
Recommended publications
  • Observational Clinical Research
    E REVIEW ARTICLE Clinical Research Methodology 2: Observational Clinical Research Daniel I. Sessler, MD, and Peter B. Imrey, PhD * † Case-control and cohort studies are invaluable research tools and provide the strongest fea- sible research designs for addressing some questions. Case-control studies usually involve retrospective data collection. Cohort studies can involve retrospective, ambidirectional, or prospective data collection. Observational studies are subject to errors attributable to selec- tion bias, confounding, measurement bias, and reverse causation—in addition to errors of chance. Confounding can be statistically controlled to the extent that potential factors are known and accurately measured, but, in practice, bias and unknown confounders usually remain additional potential sources of error, often of unknown magnitude and clinical impact. Causality—the most clinically useful relation between exposure and outcome—can rarely be defnitively determined from observational studies because intentional, controlled manipu- lations of exposures are not involved. In this article, we review several types of observa- tional clinical research: case series, comparative case-control and cohort studies, and hybrid designs in which case-control analyses are performed on selected members of cohorts. We also discuss the analytic issues that arise when groups to be compared in an observational study, such as patients receiving different therapies, are not comparable in other respects. (Anesth Analg 2015;121:1043–51) bservational clinical studies are attractive because Group, and the American Society of Anesthesiologists they are relatively inexpensive and, perhaps more Anesthesia Quality Institute. importantly, can be performed quickly if the required Recent retrospective perioperative studies include data O 1,2 data are already available.
    [Show full text]
  • Observational Studies and Bias in Epidemiology
    The Young Epidemiology Scholars Program (YES) is supported by The Robert Wood Johnson Foundation and administered by the College Board. Observational Studies and Bias in Epidemiology Manuel Bayona Department of Epidemiology School of Public Health University of North Texas Fort Worth, Texas and Chris Olsen Mathematics Department George Washington High School Cedar Rapids, Iowa Observational Studies and Bias in Epidemiology Contents Lesson Plan . 3 The Logic of Inference in Science . 8 The Logic of Observational Studies and the Problem of Bias . 15 Characteristics of the Relative Risk When Random Sampling . and Not . 19 Types of Bias . 20 Selection Bias . 21 Information Bias . 23 Conclusion . 24 Take-Home, Open-Book Quiz (Student Version) . 25 Take-Home, Open-Book Quiz (Teacher’s Answer Key) . 27 In-Class Exercise (Student Version) . 30 In-Class Exercise (Teacher’s Answer Key) . 32 Bias in Epidemiologic Research (Examination) (Student Version) . 33 Bias in Epidemiologic Research (Examination with Answers) (Teacher’s Answer Key) . 35 Copyright © 2004 by College Entrance Examination Board. All rights reserved. College Board, SAT and the acorn logo are registered trademarks of the College Entrance Examination Board. Other products and services may be trademarks of their respective owners. Visit College Board on the Web: www.collegeboard.com. Copyright © 2004. All rights reserved. 2 Observational Studies and Bias in Epidemiology Lesson Plan TITLE: Observational Studies and Bias in Epidemiology SUBJECT AREA: Biology, mathematics, statistics, environmental and health sciences GOAL: To identify and appreciate the effects of bias in epidemiologic research OBJECTIVES: 1. Introduce students to the principles and methods for interpreting the results of epidemio- logic research and bias 2.
    [Show full text]
  • Study Types Transcript
    Study Types in Epidemiology Transcript Study Types in Epidemiology Welcome to “Study Types in Epidemiology.” My name is John Kobayashi. I’m on the Clinical Faculty at the Northwest Center for Public Health Practice, at the School of Public Health and Community Medicine, University of Washington in Seattle. From 1982 to 2001, I was the state epidemiologist for Communicable Diseases at the Washington State Department of Health. Since 2001, I’ve also been the foreign adviser for the Field Epidemiology Training Program of Japan. About this Module The modules in the epidemiology series from the Northwest Center for Public Health Practice are intended for people working in the field of public health who are not epidemiologists, but who would like to increase their understanding of the epidemiologic approach to health and disease. This module focuses on descriptive and analytic epide- miology and their respective study designs. Before you go on with this module, we recommend that you become familiar with the following modules, which you can find on the Center’s Web site: What is Epidemiology in Public Health? and Data Interpretation for Public Health Professionals. We introduce a number of new terms in this module. If you want to review their definitions, the glossary in the attachments link at the top of the screen may be useful. Objectives By now, you should be familiar with the overall approach of epidemiology, the use of various kinds of rates to measure disease frequency, and the various ways in which epidemiologic data can be presented. This module offers an overview of descriptive and analytic epidemiology and the types of studies used to review and investigate disease occurrence and causes.
    [Show full text]
  • Chapter 5 Experiments, Good And
    Chapter 5 Experiments, Good and Bad Point of both observational studies and designed experiments is to identify variable or set of variables, called explanatory variables, which are thought to predict outcome or response variable. Confounding between explanatory variables occurs when two or more explanatory variables are not separated and so it is not clear how much each explanatory variable contributes in prediction of response variable. Lurking variable is explanatory variable not considered in study but confounded with one or more explanatory variables in study. Confounding with lurking variables effectively reduced in randomized comparative experiments where subjects are assigned to treatments at random. Confounding with a (only one at a time) lurking variable reduced in observational studies by controlling for it by comparing matched groups. Consequently, experiments much more effec- tive than observed studies at detecting which explanatory variables cause differences in response. In both cases, statistically significant observed differences in average responses implies differences are \real", did not occur by chance alone. Exercise 5.1 (Experiments, Good and Bad) 1. Randomized comparative experiment: effect of temperature on mice rate of oxy- gen consumption. For example, mice rate of oxygen consumption 10.3 mL/sec when subjected to 10o F. temperature (Fo) 0 10 20 30 ROC (mL/sec) 9.7 10.3 11.2 14.0 (a) Explanatory variable considered in study is (choose one) i. temperature ii. rate of oxygen consumption iii. mice iv. mouse weight 25 26 Chapter 5. Experiments, Good and Bad (ATTENDANCE 3) (b) Response is (choose one) i. temperature ii. rate of oxygen consumption iii.
    [Show full text]
  • Analytic Epidemiology Analytic Studies: Observational Study
    Analytic Studies: Observational Study Designs Cohort Studies Analytic Epidemiology Prospective Cohort Studies Retrospective (historical) Cohort Studies Part 2 Case-Control Studies Nested case-control Case-cohort studies Dr. H. Stockwell Case-crossover studies Determinants of disease: Analytic Epidemiology Epidemiology: Risk factors Identifying the causes of disease A behavior, environmental exposure, or inherent human characteristic that is associated with an important health Testing hypotheses using epidemiologic related condition* studies Risk factors are associated with an increased probability of disease but may Goal is to prevent disease (deterrents) not always cause the diseases *Last, J. Dictionary of Epidemiology Analytic Studies: Cohort Studies Panel Studies Healthy subjects are defined by their Combination of cross-sectional and exposure status and followed over time to cohort determine the incidence of disease, symptoms or death Same individuals surveyed at several poiiiints in time Subjects grouped by exposure level – exposed and unexposed (f(reference group, Can measure changes in individuals comparison group) Analytic Studies: Case-Control Studies Nested case-control studies A group of individuals with a disease Case-control study conducted within a (cases) are compared with a group of cohort study individuals without the disease (controls) Advantages of both study designs Also called a retrospective study because it starts with people with disease and looks backward for ppprevious exposures which might be
    [Show full text]
  • 1 Sensitivity Analysis in Observational Research: Introducing the E-‐Value
    Sensitivity Analysis in Observational Research: Introducing the E-value Tyler J. VanderWeele, Ph.D., Departments of Epidemiology and Biostatistics, Harvard T.H. Chan School of Public Health Peng Ding, Ph.D., Department of Statistics, University of California, Berkeley This is the prepublication, author-produced version of a manuscript accepted for publication in Annals of Internal Medicine. This version does not include post-acceptance editing and formatting. The American College of Physicians, the publisher of Annals of Internal Medicine, is not responsible for the content or presentation of the author-produced accepted version of the manuscript or any version that a third party derives from it. Readers who wish to access the definitive published version of this manuscript and any ancillary material related to this manuscript (e.g., correspondence, corrections, editorials, linked articles) should go to Annals.org or to the print issue in which the article appears. Those who cite this manuscript should cite the published version, as it is the official version of record: VanderWeele, T.J. and Ding, P. (2017). Sensitivity analysis in observational research: introducing the E-value. Annals of Internal Medicine, 167(4):268-274. 1 Abstract. Sensitivity analysis can be useful in assessing how robust associations are to potential unmeasured or uncontrolled confounding. In this paper we introduce a new measure that we call the “E-value,” a measure related to the evidence for causality in observational studies, when they are potentially subject to confounding. The E-value is defined as the minimum strength of association on the risk ratio scale that an unmeasured confounder would need to have with both the treatment and the outcome to fully eXplain away a specific treatment-outcome association, conditional on the measured covariates.
    [Show full text]
  • Chapter 20 Design and Analysis of Experiments and Observational
    Chapter 20 Design and Analysis of Experiments and Observational Studies Point of both observational studies and designed experiments is to identify variable or set of variables, called explanatory variables, which are thought to predict outcome or response variable. Observational studies are an investigation of the association between treatment and response, where subject who decide whether or not they take a treatment. In experimental designs, experimenter decides who is given a treat- ment and who is to be a control, the experimenter alters levels of treatments when investigating how these alterations cause change in the response. 20.1 Observational Studies Exercise 20.1 (Observational Studies) 1. Observational Study versus Designed Experiment. (a) Effect of air temperature on rate of oxygen consumption (ROC) of four mice is investigated. ROC of one mouse at 0o F is 9.7 mL/sec for example. temperature (Fo) 0 10 20 30 ROC (mL/sec) 9.7 10.3 11.2 14.0 Since experimenter (not a mouse!) decides which mice are subjected to which temperature, this is (choose one) observational study / designed experiment. (b) Indiana police records from 1999–2001 on six drivers are analyzed to de- termine if there is an association between drinking and traffic accidents. One heavy drinker had 6 accidents for example. 169 170Chapter 20. Design and Analysis of Experiments and Observational Studies (lecture notes 10) drinking → heavy light 3 1 6 2 2 1 This is an observational study because (choose one) i. police decided who was going to drink and drive and who was not. ii. drivers decided who was going to drink and drive and who was not.
    [Show full text]
  • Meta-Analysis of Observational Studies in Epidemiology (MOOSE)
    CONSENSUS STATEMENT Meta-analysis of Observational Studies in Epidemiology A Proposal for Reporting Donna F. Stroup, PhD, MSc Objective Because of the pressure for timely, informed decisions in public health and Jesse A. Berlin, ScD clinical practice and the explosion of information in the scientific literature, research Sally C. Morton, PhD results must be synthesized. Meta-analyses are increasingly used to address this prob- lem, and they often evaluate observational studies. A workshop was held in Atlanta, Ingram Olkin, PhD Ga, in April 1997, to examine the reporting of meta-analyses of observational studies G. David Williamson, PhD and to make recommendations to aid authors, reviewers, editors, and readers. Drummond Rennie, MD Participants Twenty-seven participants were selected by a steering committee, based on expertise in clinical practice, trials, statistics, epidemiology, social sciences, and biomedi- David Moher, MSc cal editing. Deliberations of the workshop were open to other interested scientists. Fund- Betsy J. Becker, PhD ing for this activity was provided by the Centers for Disease Control and Prevention. Theresa Ann Sipe, PhD Evidence We conducted a systematic review of the published literature on the con- duct and reporting of meta-analyses in observational studies using MEDLINE, Educa- Stephen B. Thacker, MD, MSc tional Research Information Center (ERIC), PsycLIT, and the Current Index to Statistics. for the Meta-analysis Of We also examined reference lists of the 32 studies retrieved and contacted experts in Observational Studies in the field. Participants were assigned to small-group discussions on the subjects of bias, Epidemiology (MOOSE) Group searching and abstracting, heterogeneity, study categorization, and statistical methods.
    [Show full text]
  • Statistical Overview for Clinical Trials Basics of Design and Analysis of Controlled Clinical Trials
    Statistical Overview for Clinical Trials Basics of Design and Analysis of Controlled Clinical Trials Presented by: Behrang Vali M.S., CDER/OTS/OB/DB3 Special Thanks to: LaRee Tracy, Mike Welch, Ruthanna Davi, and Janice Derr 1 Disclaimer The views expressed in this presentation do not necessarily represent those of the U.S. Food and Drug Administration. 2 Common Study Designs • Case-Control Study – Retrospective Observational Study • Cohort Study – Prospective Observational Study • e.g. Natural History Studies • Controlled Clinical Trial (focus of this portion of the presentation) – Prospective Experimental Study – Best design to study causality – Utilized for design of adequate and well- controlled trials 3 Study Objective/Research Question Study Population of Interest and Design Endpoint and Measurement of Outcome Test Drug Control Drug Statistical Comparisons Interpretation Extrapolation 4 Study Objective/Research Question 5 STUDY OBJECTIVE • What do you want to show with this trial? • What is the population of interest? • What trial design will be used? • What are the specific study hypotheses? 6 What do you want to show with this trial? Efficacy • New drug is more efficacious than current standard therapy - active controlled superiority trial • New drug is more efficacious than placebo treatment - placebo controlled superiority trial • New drug is not worse (in terms of efficacy), by a pre-specified and clinically relevant amount, than the current therapy - active controlled non- inferiority trial – Note that there are no placebo
    [Show full text]
  • CONCEPTUALISING NATURAL and QUASI EXPERIMENTS in PUBLIC HEALTH Frank De Vocht1,2,3, Srinivasa Vittal Katikireddi4, Cheryl Mcquir
    CONCEPTUALISING NATURAL AND QUASI EXPERIMENTS IN PUBLIC HEALTH Frank de Vocht1,2,3, Srinivasa Vittal Katikireddi4, Cheryl McQuire1,2, Kate Tilling1,5, Matthew Hickman1, Peter Craig4 1 Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK 2 NIHR School for Public Health Research, Newcastle, UK 3 NIHR Applied Research Collaboration West, Bristol, UK 5 MRC IEU, University of Bristol, Bristol, UK 4 MRC/CSO Social and Public Health Sciences Unit, University of Glasgow, Bristol, UK Corresponding author: Dr Frank de Vocht. Population Health Sciences, Bristol Medical School, University of Bristol. Canynge Hall, 39 Whatley Road, Bristol, United Kingdom. BS8 2PS. Email: [email protected]. Tel: +44.(0)117.928.7239. Fax: N/A 1 ABSTRACT Background Natural or quasi experiments are appealing for public health research because they enable the evaluation of events or interventions that are difficult or impossible to manipulate experimentally, such as many policy and health system reforms. However, there remains ambiguity in the literature about their definition and how they differ from randomised controlled experiments and from other observational designs. Methods We conceptualise natural experiments in in the context of public health evaluations, align the study design to the Target Trial Framework, and provide recommendation for improvement of their design and reporting. Results Natural experiment studies combine features of experiments and non-experiments. They differ from RCTs in that exposure allocation is not controlled by researchers while they differ from other observational designs in that they evaluate the impact of event or exposure changes. As a result they are, in theory, less susceptible to bias than other observational study designs.
    [Show full text]
  • 1.3 Types of Studies: Observational Studies and Experiments
    1.3 Types of studies: Observational Studies and Experiments 1 Observational Studies ! Simply observing what happens " An opinion sample survey is an observational study. " Researcher does not ‘impose’ a treatment or randomly assign subjects to a group. 2 Study: Tanning and Skin Cancer ! The observational study involved 1,500 people. ! Selected a group of people who had skin cancer and another group of people who did not have skin cancer " Group membership based on innate characteristics in the subjects (they were not assigned to a group). ! Asked all participants whether they used tanning beds. " Wanted to see if there was an association between tanning beds and skin cancer prevalence. 3 Study: Sodium and Blood Pressure ! Enroll 100 individuals in the observational study. ! Give them diet diaries where they record everything they eat each day. From this the amount of sodium in the diet is found. ! Not assigning subjects to any certain amount of sodium (they choose it on their own). ! Measure their blood pressure. 4 Types of observational studies ! Retrospective – look at past records and historical data. " Tanning and Skin Cancer " Can be a Case-Control study ! Prospective – identify subjects and collect data as events unfold. " Sodium and Blood Pressure " If you collect data at multiple time points, it is a longitudinal study. ! Observational studies are often used in marketing or health studies. 5 Case-Control study ! The conclusions for observational studies are not as strong as experiments (which use randomization), but sometimes we can not ethically perform an experiment. " Can you randomly assign individuals to be in a cigarette smoking group? No.
    [Show full text]
  • Chapter 5 Overview of Study Designs
    Text book eng. Chap.5 final 27/05/02 9:14 Page 83 (PANTONE(Black/Process 313 CV Blackfilm) film) Chapter 5 Overview of study designs In epidemiology, measuring the occurrence of disease or other health- related events in a population is only a beginning. Epidemiologists are also interested in assessing whether an exposure is associated with a particular disease (or other outcome of interest). For instance, researchers may be interested in obtaining answers to the following questions: * Does a high-fat diet increase the risk of breast cancer? High-fat diet Breast cancer (exposure) (outcome) * Does hepatitis B virus infection increase the risk of liver cancer? Hepatitis B infection Liver cancer (exposure) (outcome) The first step in an epidemiological study is to define the hypothesis to be tested. This should include a precise definition of the exposure(s) and outcome(s) under study. The next step is to decide which study design will be the most appropriate to test that specific study hypothesis. 5.1 Types of study design There are two basic approaches to assessing whether an exposure is asso- ciated with a particular outcome: experimental and observational. The experimental approach is perhaps more familiar to clinicians, since it cor- responds to the approach used for investigations in laboratory-based research. In an experiment, investigators study the impact of varying some factor which they can control. For example, the investigators may take a litter of rats, and randomly select half of them to be exposed to a suppos- edly carcinogenic agent, then record the frequency with which cancer develops in each group.
    [Show full text]