Case-Control Studies Learning Objectives

Total Page:16

File Type:pdf, Size:1020Kb

Case-Control Studies Learning Objectives Case-control studies • Overview of different types of studies • Review of general procedures • Sampling of controls – implications for measures of association – implications for bias • Logistic regression modeling Learning Objectives • To understand how the type of control sampling relates to the measures of association that can be estimated • To understand the differences between the nested case-control study and the case-cohort design and the advantages and disadvantages of these designs • To understand the basic procedures for logistic regression modeling Overview of types of case-control studies No designated cohort, Within a designated cohort but should treat source population as cohort Nested case control Case-cohort Cases only Cases and controls 1) Type of sampling - incidence density Case-crossover - cumulative “epidemic” 2) Source of controls - population-based - hospital - neighborhood - friends -family 1 Review of General Procedures Obtaining cases Obtaining controls 1) Define target cases 1) Define target controls 2) Identify potential cases 2) Define mechanism for 3) Confirm diagnosis identifying controls 4) *Obtain physician’s consent 3) Contact control 5) Contact case 4) Confirm control’s eligibility 6) Confirm case’s eligibility 5) *Obtain control’s consent 7) *Obtain case’s consent 6) Obtain exposure data 8) Obtain exposure data *need to account for nonresponse Pros/Cons of Different Methods Collection of information In-person Telephone Mail - hospital - clinic - home Case cohort study T0 Additional cases, Assemble compare all cases to the cohort, sample First case subcohort selected subcohort to be at T0 used in all future analyses 2 Nested case-control Study T0 Assemble Likewise, select controls cohort again at each separate time point; First case; note: each of these randomly cases was eligible to be a control for the select control first case from remaining cohort Crossover designs Period the subject is exposed Period the subject is unexposed • Case-crossover: variation of crossover; case has a pre-disease period which is used as the control period • Good method for control of confounding • May have limited applications • Assumes that neither exposure nor confounders are changing over time in a systematic way Source Population • Think of all case-control studies as nested within a cohort, even when the cohort is not designated • Source population is this underlying cohort • Source population describes the cohort giving rise to the cases; controls are also from this source population • Source population reflects the disease under study, difficulties in diagnosing disease, routine procedures for recording the disease occurrence, and the frequency of disease 3 Classic case-control Source population Cases Controls Sampling fraction (f1) Sampling fraction f2 Exposed Non-exposed Exposed Nonexposed (A) (C) (B) (D) Cases Controls Odds ratio = AD/BC Exposed A B If sampling: OR = f1*A*f2*D = AD Unexposed C D f1*C*f2*B BC Incidence Density Sampling 1) Collect information on each case 2) Collect control at the same time each case is observed; collect control from the underlying source population giving rise to the cases 3) Controls can be cases Cumulative-based sampling 1) Start after the event has happened 2) Ascertain cases 3) Collect controls from the noncases after event (e.g., epidemic) 4 Measures of Association • Key concept, sampling fraction, independent of exposure • Incidence density sampling/nested case-control studies – if exposure odds in controls(B/D) approximates person-time ratio for source population, odds ratio will approximate rate ratio – without rare disease assumption Measures of Association • Case cohort – if ratio B/D approximates overall prevalence of exposure in source population, odds ratio will approximate risk ratio – without rare disease assumption • Cumulative “Epidemic” case-control studies – odds ratio will approximate rate ratio if proportion diseased in each exposure group is low (< 20%) and remains steady during study period When is the rare disease assumption needed? • Cumulative-based sampling if want to approximate the relative risk • Otherwise – nested case-control and incidence density sampling will give an estimate of rate ratio without rare disease assumption – case-cohort will give an estimate of risk ratio without rare disease assumption • If disease is rare, all three measures will be very close 5 A closer look Recall: Note: in R&G notation A1 I T A T A1= A 1 1 1 o IR = = = Ao= C I A0 A T 0 0 1 B1= B T0 B0= D A1 A1 (1− A ) B A B OR = 1 = 1 = 1 o A A 0 0 A0 B1 (1− A0 ) B0 If sampling does not lead to bias and the sample can approximate the person-years of distribution, a case- control study is a more efficient design (i.e. for same number of people, more precision). However not as precise as if you use the full cohort. Pseudo Rates B B This assumes that the sampling f = 1 = 0 = r 2 T T rates are the same for the 1 0 exposed and unexposed A1 Pseudo-rate1 = B1 A0 if r is unknown : Pseudo-rate0 = B 0 A1 T1 A1 1 A1 A1 * If r is known then B B T T r T 1 = 1 1 = 1 = 1 multiplyby r to get rate A0 A T A0 1 A0 0 0 * B0 T r T A1 A1 B1 B0 T0 o o *r = = I1 B1 B1 T1 Pseudo Risks This assumes that the sampling B1 B0 rates are the same for the exposed f2 = = = f N1 N0 and unexposed A Pseudo- risk = 1 1 If f is unknown : B1 A0 A1 N1 A1 1 A1 A1 Pseudo- risk0 = * B B B N N f N 0 1 = 1 1 = 1 = 1 A0 A N A0 1 A0 If f is known then multiply 0 0 * B0 N f N by f to get incidence proportion B0 N0 0 0 A1 A1 B1 * f = = R1 B1 B1 N1 6 Summary • Odds Ratio • Odds Ratio approximates Rate approximates Risk Ratio Ratio – incidence density sampling – case-cohort design – nested case- control studies – cumulative – cumulative sampling if sampling is disease is rare proportion exposed is steady and relatively low Source of Controls population-based: not necessarily equal probability, control selection probability is proportional to the individual’s person time at risk neighborhood: need to think about referral patterns (e.g., not good for veteran’s hosp); also need to worry about overmatching hospital: need to be especially concerned that sampling was independent of exposure; try to use a variety of diagnoses for controls. friends: main problem is with overmatching; cedes the control selection to the case rather than to the investigator family:may be worthwhile design to control for certain variable (e.g., spouse control, environment, twin control genetics and environment) Review of control selection • Select from source population • Select independent of exposure status • Probability of selection should be proportion to amount s/he would have contributed to person-time in the denominator • Not eligible to be a control, if during same time would not have been eligible to be a case 7 Ille-et-Vilaine study (epidat1.txt) • conducted between January 1972 and April 1974 • French department of Ille-et-Vilaine (Brittany) • Men diagnosed in regional hospitals • Controls sampled from electoral lists in each commune of department Other methodological points • exposure opportunity – interest is in the fact of exposure – think about cohort design – make same exclusions to cases as to controls • comparability of information – may not want comparability if errors are not independent • number of different types of control groups – may be appropriate in some situations (e.g., spouse, siblings), but generally not recommended • prior disease history Regression Modeling Regression model- simpler function used to estimate the true regression function Benefit of Regression Modeling: overcome the numerical limitations of categorical analysis Cost of Regression Modeling: assumptions of model; invalid inferences if model is misspecified E(Y|X = x) Y = dependent variable, outcome variable, regressand X = set of independent variables, predictors, covariates, regressors 8 Logistic Regression Y When x = 1, log = α + β x 1−Y eα +β x α +β x α +β x e Y 1+ e α +β x Y = = R(x1) = = e α +β x 1−Y 1+ eα +β x eα +β x 1+ e − When x = 0, 1+ eα +β x 1+ eα +β x eα Y log | x =1 α +β x Y α 1−Y e = 1+ e = eα = log = β 1−Y 1+ eα eα Y eα − log | x = 0 1+ eα 1+ eα 1−Y R(x1) Logistic risk model, bounded by 0 and 1 Likelihood in logistic regression α+∑ β x α+∑ β x e i i e i i (1− ) ∏ α+∑ βixi ∏ α+∑ βixi 1+ e 1+ e Cases Controls Interpreting Coefficients Y log( ) = α + β x 1−Y if X = 1 if exposed, 0 if unexposed Y log( ) | x =1 α + β 1−Y = = β Y log( ) | x = 0 α 1−Y OR = eβ If X is continuous, B represents the impact of a one-unit change in X to Y, the exponential of B will give you the OR for a one unit change Note: both modeling variables as either ordered categorical variables or continuous variables assumes the one unit change in X is the same irrespective of the level of X 9 Logistic Modeling Linear Model Logistic Model Log ( Y) Y $ (1-Y) $ " " Not bounded by 0 and 1 Y = Probability of Disease Y = Odds of Disease 1- Y Y log( ) = Log odds of Disease 1- Y Assessing Linearity • Create categories of continuous variable; categories should represent the same amount of units (e.g., 0-39 grams, 40-79 grams, 80-119, etc) • Plot Beta coefficients; if pattern is approximately linear, keep variable as a continuous variable in model Logistic Model Log ( Y) * * (1-Y) * * * * * * $1 $2 $3 $4 10 Creating Indicator Variables Also referred to as categorical regressors, dummy variables Need to pick a reference level Number of indicator variables created = # categories - 1 The intercept
Recommended publications
  • Dictionary of Epidemiology, 5Th Edition
    A Dictionary of Epidemiology This page intentionally left blank A Dictionary ofof Epidemiology Fifth Edition Edited for the International Epidemiological Association by Miquel Porta Professor of Preventive Medicine & Public Health School of Medicine, Universitat Autònoma de Barcelona Senior Scientist, Institut Municipal d’Investigació Mèdica Barcelona, Spain Adjunct Professor of Epidemiology, School of Public Health University of North Carolina at Chapel Hill Associate Editors Sander Greenland John M. Last 1 2008 1 Oxford University Press, Inc., publishes works that further Oxford University’s objective excellence in research, scholarship, and education Oxford New York Auckland Cape Town Dar es Salaam Hong Kong Karachi Kuala Lumpur Madrid Melbourne Mexico City Nairobi Shanghai Taipei Toronto With offi ces in Argentina Austria Brazil Chile Czech Republic France Greece Guatemala Hungary Italy Japan Poland Portugal Singapore South Korea Switzerland Thailand Turkey Ukraine Vietnam Copyright © 1983, 1988, 1995, 2001, 2008 International Epidemiological Association, Inc. Published by Oxford University Press, Inc. 198 Madison Avenue, New York, New York 10016 www.oup-usa.org All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior permission of Oxford University Press. This edition was prepared with support from Esteve Foundation (Barcelona, Catalonia, Spain) (http://www.esteve.org) Library of Congress Cataloging-in-Publication Data A dictionary of epidemiology / edited for the International Epidemiological Association by Miquel Porta; associate editors, John M. Last . [et al.].—5th ed. p. cm. Includes bibliographical references and index. ISBN 978–0-19–531449–6 ISBN 978–0-19–531450–2 (pbk.) 1.
    [Show full text]
  • The Diagnostic Interval of Colorectal Cancer Patients in Ontario by Degree of Rurality
    THE DIAGNOSTIC INTERVAL OF COLORECTAL CANCER PATIENTS IN ONTARIO BY DEGREE OF RURALITY by Leah Hamilton A thesis submitted to the Graduate Program in the Department of Public Health Sciences in conformity with the requirements for the Degree of Master of Science Queen’s University Kingston, Ontario, Canada (September, 2015) Copyright ©Leah Hamilton, 2015 Abstract Background: Wait times while moving through the cancer diagnostic process are a public health concern. Rural populations may experience more challenges in accessing cancer care, which could translate into a longer diagnostic interval and represent a healthcare inequity. This project analyzed the association between rurality of residence and the diagnostic interval of colorectal cancer (CRC) patients in Ontario, Canada. Methods: This was a retrospective population-based cohort study. We used administrative databases available through the Institute for Clinical Evaluative Sciences (ICES) to identify incident CRC cases diagnosed from Jan 1, 2007- May 31, 2012. We assigned each patient a rurality score, based on their census subdivision, and calculated the length of their diagnostic interval. We defined the diagnostic interval as the time (in days) between a patient’s first diagnostic-related encounter with the health care system to the CRC diagnosis date. Data linkage through ICES allowed us to describe variations in cancer stage and the diagnostic interval by degree of rurality of patient residence and to analyze associations through multivariable models taking into account potential confounders. Results: Overall, the median diagnostic interval of the CRC cohort was 64 (IQR: 22-159) days and the 90th percentile was 288 days. Patients with stage I CRC had a longer median diagnostic interval than patients with stage IV CRC.
    [Show full text]
  • Estimating Risk and Rate Levels, Ratios and Differences in Case-Control
    STATISTICS IN MEDICINE Statist. Med. 2002; 21:1409–1427 (DOI: 10.1002/sim.1032) Estimating risk and rate levels, ratios and dierences in case-control studies Gary King1;∗;†;‡ and Langche Zeng2 1Department of Government; Harvard University and Global Programme on Evidence for Health Policy; World Health Organization (Center for Basic Research in the Social Sciences; 34 Kirkland Street; Harvard University; Cambridge; MA 02138; U.S.A.) 2Department of Political Science; George Washington University; 2201 G Street NW; Washington; DC 20052; U.S.A. SUMMARY Classic (or ‘cumulative’) case-control sampling designs do not admit inferences about quantities of interest other than risk ratios, and then only by making the rare events assumption. Probabilities, risk dierences and other quantities cannot be computed without knowledge of the population incidence fraction. Similarly, density (or ‘risk set’) case-control sampling designs do not allow inferences about quantities other than the rate ratio. Rates, rate dierences, cumulative rates, risks, and other quantities cannot be estimated unless auxiliary information about the underlying cohort such as the number of controls in each full risk set is available. Most scholars who have considered the issue recommend reporting more than just risk and rate ratios, but auxiliary population information needed to do this is not usually available. We address this problem by developing methods that allow valid inferences about all relevant quantities of interest from either type of case-control study when completely ignorant of or only partially knowledgeable about relevant auxiliary population information. Copyright ? 2002 John Wiley & Sons, Ltd. KEY WORDS: statistics; data interpretation; logistic models; case-control studies; relative risk; odds ratio; risk ratio; risk dierence; hazard rate; rate ratio; rate dierence 1.
    [Show full text]
  • Case–Control Studies: Basic Concepts
    This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/ by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. Published by Oxford University Press on behalf of the International Epidemiological Association International Journal of Epidemiology 2012;41:1480–1489 ß The Author 2012; all rights reserved. doi:10.1093/ije/dys147 Case–control studies: basic concepts Jan P Vandenbroucke1* and Neil Pearce2,3 1Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, The Netherlands, 2Faculty of Epidemiology and Population Health, Departments of Medical Statistics and Non-communicable Disease Epidemiology, London School of Hygiene and Tropical Medicine, London, UK and 3Centre for Public Health Research, Massey University, Wellington, New Zealand *Corresponding author. Department of Clinical Epidemiology, Leiden University Medical Center, PO Box 9600, 2300 RC Leiden, The Netherlands. E-mail: [email protected] Accepted 7 August 2012 Downloaded from The purpose of this article is to present in elementary mathematical and statistical terms a simple way to quickly and effectively teach and understand case–control studies, as they are commonly done in dy- namic populations—without using the rare disease assumption. Our http://ije.oxfordjournals.org/ focus is on case–control studies of disease incidence (‘incident case– control studies’);
    [Show full text]
  • Costs and Benefits of Environmental Data in Investigations of Gene
    Costs and Benefits of Environmental Data in Investigations of Gene-Disease Associations by Hao Luo B.Sc., Nanjing University, 2010 A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF SCIENCE in The Faculty of Graduate Studies (Statistics) THE UNIVERSITY OF BRITISH COLUMBIA (Vancouver) August 2012 c Hao Luo, 2012 Abstract The inclusion of environmental exposure data may be beneficial, in terms of statistical power, to investigation of gene-disease association when it exists. However, resources invested in obtaining exposure data could instead be applied to measure disease status and genotype on more subjects. In a cohort study setting, we consider the tradeoff between measuring only disease status and genotype for a larger study sample and measuring disease status, genotype, and environmen- tal exposure for a smaller study sample, under the ‘Mendelian randomization’ as- sumption that the environmental exposure is independent of genotype in the study population. We focus on the power of tests for gene-disease association, applied in situations where a gene modifies risk of disease due to particular exposure without a main effect of gene on disease. Our results are equally applicable to exploratory genome-wide association studies and more hypothesis-driven candidate gene in- vestigations. We further consider the impact of misclassification for environmental exposures. We find that under a wide range of circumstances research resources should be allocated to genotyping larger groups of individuals, to achieve a higher power for detecting presence of gene-environment interactions by studying gene- disease association. ii Table of Contents Abstract . ii Table of Contents .
    [Show full text]
  • Case-Control Studies
    2012 ISPE Mid-Year Meeting Introduction to Pharmacoepidemiology April 22, 2012 Case-Control Studies Tobias Gerhard, PhD Assistant Professor, Ernest Mario School of Pharmacy Institute for Health, Health Care Policy, and Aging Research Rutgers University Conflict of Interest • The views and opinions expressed in this presentation are solely mine and do not represent the position or opinion of ISPE or any other institution. • I have no conflicts of interest to declare. Outline • Overview and general principles • Measures of association • Variants of the case-control design – Incidence density case-control studies – Case-cohort studies – Cumulative case-control studies • Final thoughts and take-home points Epidemiologic Study Designs Preface • Case-control studies are difficult to understand • Many misconceptions prevail • If this is new to you, don’t be dissappointed if you don’t follow-everything the first time Case-Control Studies Definition The observational epidemiologic study of persons with the disease of interest and a suitable control group of persons without the disease. The relationship of an attribute to the disease is examined by comparing the diseased and non-diseased with regard to how frequently the atribute is present. John M. Last, Dictionary of Epidemiology Cohort Studies Design 2012 2011 2010 2009 2008 Group A Group B Exposed to drug Not exposed to drug DISEASE FREQUENCY IN EXPOSED VS. UNEXPOSED Case-Control Studies Design Cases Controls 2012 2011 2010 2009 2008 FREQUENCY OF DRUG EXPOSURE IN CASES VS. CONTROLS Case-Control Studies Conduct 1. Define the source population for the study (hypothetical study population in which a cohort might have been conducted).
    [Show full text]
  • Case–Control Studies: Basic Concepts
    This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/ by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. Published by Oxford University Press on behalf of the International Epidemiological Association International Journal of Epidemiology 2012;41:1480–1489 ß The Author 2012; all rights reserved. doi:10.1093/ije/dys147 Case–control studies: basic concepts Jan P Vandenbroucke1* and Neil Pearce2,3 1Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, The Netherlands, 2Faculty of Epidemiology and Population Health, Departments of Medical Statistics and Non-communicable Disease Epidemiology, London School of Hygiene and Tropical Medicine, London, UK and 3Centre for Public Health Research, Massey University, Wellington, New Zealand *Corresponding author. Department of Clinical Epidemiology, Leiden University Medical Center, PO Box 9600, 2300 RC Leiden, The Netherlands. E-mail: [email protected] Accepted 7 August 2012 Downloaded from The purpose of this article is to present in elementary mathematical and statistical terms a simple way to quickly and effectively teach and understand case–control studies, as they are commonly done in dy- namic populations—without using the rare disease assumption. Our focus is on case–control studies of disease incidence (‘incident case– http://ije.oxfordjournals.org/ control studies’);
    [Show full text]
  • 8. Analytic Study Designs
    8. Analytic study designs The architecture of the various strategies for testing hypotheses through epidemiologic studies, a comparison of their relative strengths and weaknesses, and an in-depth investigation of major designs. Epidemiologic study designs In previous topics we investigated issues in defining disease and other health-related outcomes, in quantitating disease occurrence in populations, in relating disease rates to factors of interest, and in exploring and monitoring disease rates and relationships in populations. We have referred to cohort studies, cross-sectional, and case-control studies as the sources of the measures we examined, but the study designs themselves were secondary to our interest. In the present chapter we will define and compare various study designs and their usefulness for investigating relationships between an outcome and an exposure or study factor. We will then examine two designs – intervention trials and case-control studies – in greater depth. The study designs discussed in this chapter are called analytic because they are generally (not always) employed to test one or more specific hypotheses, typically whether an exposure is a risk factor for a disease or an intervention is effective in preventing or curing disease (or any other occurrence or condition of interest). Of course, data obtained in an analytic study can also be explored in a descriptive mode, and data obtained in a descriptive study can be analyzed to test hypotheses. Thus, the distinction between "descriptive" and "analytic" studies is one of intent, objective, and approach, rather than one of design. Moreover, the usefulness of the distinction is being eroded by a broad consensus (dogma?) in favor of testing hypotheses.
    [Show full text]
  • Workingsci1.Qk
    Research Commentary Effect Measures in Prevalence Studies Neil Pearce Centre for Public Health Research, Massey University Wellington Campus, Wellington, New Zealand commonly used effect measure is the risk There is still considerable confusion and debate about the appropriate methods for analyzing preva- ratio, which is the ratio of the incidence pro- lence studies, and a number of recent papers have argued that prevalence ratios are the preferred portion in the exposed group (a/N1) to that in method and that prevalence odds ratios should not be used. These arguments assert that the preva- the nonexposed group (b/N0). In this example, lence ratio is obviously the better measure and the odds ratio is “unintelligible.” They have often the risk ratio is 0.1813/0.0952 = 1.90. A third been accompanied by demonstrations that when a disease is common the prevalence ratio and the possible effect measure is the incidence odds prevalence odds ratio may differ substantially. However, this does not tell us which measure is the ratio, which is the ratio of the incidence odds more valid to use. In fact, the prevalence odds ratio a) estimates the incidence rate ratio with fewer in the exposed group (a/c) to that in the non- assumptions than are required for the prevalence ratio; b) can be estimated using the same methods exposed group (b/d). In this example, the odds as for the odds ratio in case–control studies, namely, the Mantel–Haenszel method and logistic ratio is 0.2214/0.1052 = 2.11. regression; and c) provides practical, analytical, and theoretical consistency between analyses of a These three multiplicative effect measures prevalence study and prevalence case–control analyses based on the same study population.
    [Show full text]
  • Case-Control Studies
    Case-control studies Madhukar Pai, MD, PhD McGill University Montreal [email protected] 1 Introduction to case control designs Introduction The case control (case-referent) design is really an efficient sampling technique for measuring exposure- disease associations in a cohort that is being followed up or “study base” All case-control studies are done within some cohort (defined or not) In reality, the distinction between cohort and case-control designs is artificial Ideally, cases and controls must represent a fair sample of the underlying cohort or study base Toughest part of case-control design: defining the study base and selecting controls who represent the base 2 New cases occur in a study base Study base is the aggregate of population-time in a defined study Population’s movement over a defined span of time [OS Miettinen, 2007] Cohort and case- control designs differ in the way cases and the study base are sampled to estimate the incidence density ratio 3 Morgenstern IJE 1980 Cohort design Total cohort (total person-time) New cases / exposed person-time IDR = Iexp / Iunexp The IDR is directly estimated by dividing the incidence New cases / unexposed person-time density in the exposed person-time by the incidence 4 density in the unexposed person-time Case-control design: a more efficient way of estimating the IDR Obtain a fair sample of * ** ** the cases (“case series”) that occur in the study ** *** base; estimate odds of *** * * exposure in case series Obtain a fair sample of the //////// study base itself (“base series”); estimate the odds //////// of exposure in the study //////// base [which is a reflection of baseline exposure in the population] The exposure odds in the case series is divided by the OR = IDR exposure odds in the base series to estimate the OR, which will approximate the IDR 5 Three principles for a case-control study (Wacholder et al.
    [Show full text]
  • Kristen Ingrid Mcmartin
    PREGNANCY OUTCOME FOLLOWING MATERNAL ORGANIC SOLVENT EXPOSURE: A META-ANALYSE OF EPIDEMIOLOGICAL STUDIES Kristen Ingrid McMartin A thesis submitted in conformity with the requirements for the degree of M.Sc. Graduate Department of Pharmacology University of Toronto O Copyright by Kristen Ingrid McMartin 1997 National Library Bibliothèque nationale 1+1 of Canada du Canada Acquisitions and Acquisitions et Bibliographic Services services bibliographiques 395 Wellington Street 395. nie Wellington Ottawa ON K1A ON4 Ottawa ON KIA ON4 Canada Canada The author has granted a non- L'auteur a accordé une licence non exclusive licence allowing the exclusive permettant à la National Library of Canada to Bibliothèque nationale du Canada de reproduce, loan, distribute or seU reproduire, prêter, distribuer ou copies of this thesis in microform, vendre des copies de cette thèse sous paper or electronic formats. la forme de microfiche/nlm, de reproduction sur papier ou sur format électronique. The author retains ownership of the L'auteur conserve la propriété du copyright in this thesis. Neither the droit d'auteur qui protège cette thèse. thesis nor substantid extracts fkom it Ni la thèse ni des extraits substantiels may be printed or otberwise de celle-ci ne doivent être imprimés reproduced without the author's ou autrement reproduits sans son permission. autorisation. Preenancv Outcome Followine Materna1 Oreanic Solvent Ex~osure: A Meta-Analvsis of E~idemiologicalStudies Kristen Ingrid McMartin (M-SC.- 1997) Department of Pharmacology University of Toronto Evidence of fetal damage or demise from organic solvent levels that are not toxic to the pregnant woman is inconsistent in the medical literature.
    [Show full text]
  • The Rare Disease Assumption: the Good, the Bad, and the Ugly
    The Rare Disease Assumption: The Good, The Bad, and The Ugly A Thesis Presented in Partial Fulfillment of the Requirements for the Degree Master of Science in the Graduate School of The Ohio State University By Matthew Brems, B.A. Graduate Program in Statistics The Ohio State University 2015 Master's Examination Committee: Dr. Shili Lin, Advisor Dr. Eloise Kaizar c Copyright by Matthew Brems 2015 Abstract In epidemiologic studies, researchers often calculate the relative risk or the odds ratio in order to quantify associations between a disease and a covariate. For decades, epidemiologists have utilized the \rare disease assumption" in order to simplify calculations by treating the odds ratio and the relative risk as approximately equal. In genetic epidemiology, the rare disease assumption is frequently made under various genetic condi- tions. However, in the context of genetic epidemiology, this assumption may merely be a necessary, but not a sufficient, condition. Thus rely- ing on the rare disease assumption may lead to unintended consequences including, but not limited to, misrepresented power levels, inflated Type I error rates, and increased bias of estimators. I explore various genetic scenarios through case studies toward the goal of defining circumstances under which the rare disease assumption is sufficient and circumstances under which the rare disease assumption is merely necessary. These case studies show that the rare disease assumption can sometimes leads to good results, sometimes to bad results, and sometimes to ugly results. In particular, I explore three problems that are split into three sections based on the articles which inspired the discussion of these problems.
    [Show full text]