FORMULAS from EPIDEMIOLOGY KEPT SIMPLE (3E) Chapter 3: Epidemiologic Measures

FORMULAS FROM EPIDEMIOLOGY KEPT SIMPLE (3e) Chapter 3: Epidemiologic Measures Basic epidemiologic measures used to quantify: • frequency of occurrence • the effect of an exposure • the potential impact of an intervention. Epidemiologic Measures Measures of disease Measures of Measures of potential frequency association impact (“Measures of Effect”) Incidence Prevalence Absolute measures of Relative measures of Attributable Fraction Attributable Fraction effect effect in exposed cases in the Population Incidence proportion Incidence rate Risk difference Risk Ratio (Cumulative (incidence density, (Incidence proportion (Incidence Incidence, Incidence hazard rate, person- difference) Proportion Ratio) Risk) time rate) Incidence odds Rate Difference Rate Ratio (Incidence density (Incidence density difference) ratio) Prevalence Odds Ratio Difference Macintosh HD:Users:buddygerstman:Dropbox:eks:formula_sheet.doc Page 1 of 7 3.1 Measures of Disease Frequency No. of onsets Incidence Proportion = No. at risk at beginning of follow-up • Also called risk, average risk, and cumulative incidence. • Can be measured in cohorts (closed populations) only. • Requires follow-up of individuals. No. of onsets Incidence Rate = ∑person-time • Also called incidence density and average hazard. • When disease is rare (incidence proportion < 5%), incidence rate ≈ incidence proportion. • In cohorts (closed populations), it is best to sum individual person-time longitudinally. It can also be estimated as Σperson-time ≈ (average population size) × (duration of follow-up). Actuarial adjustments may be needed when the disease outcome is not rare. • In an open populations, Σperson-time ≈ (average population size) × (duration of follow-up). Examples of incidence rates in open populations include: births Crude birth rate (per m) = × m mid-year population size deaths Crude mortality rate (per m) = × m mid-year population size deaths < 1 year of age Infant mortality rate (per m) = × m live births No. of cases Prevalence Proportion = No. of individuals in the study • Also called point prevalence or just prevalence. • The concept of period prevalence should be avoided when possible because it confuses the concepts of incidence and prevalence (Elandt-Johnson & Johnson, 1980). • Prevalence dependence on the “inflow” and “outflow” of disease according to this formula Prevalence ≈ (incidence rate) × (average duration of illness). Additional Notes • Terminology: The term “rate” is often used loosely, to refer to any of the above measures of disease frequency (even though the only true rate is the incidence density rate • Odds: Both prevalence and incidence proportions may be addressed in terms of odds. Let p represent the incidence proportion or prevalence proportion of disease and o represent the odds of disease. Thus, odds o = p / (1 – p). • Reporting: To report a risk or rate “per m,” simply multiply it by m. For example, an incidence proportion of 0.0010 = 0.0010 × 10,000 = 10 per 10,000. • Uni-cohort: To report a risk or rate as a unicohort, take its reciprocal and report it as 1 in 1 “unicohort.” For example, an incidence proportion of 0.0025 = 1 in or “1 in 400.” 0.0025 Macintosh HD:Users:buddygerstman:Dropbox:eks:formula_sheet.doc Page 2 of 7 3.2 Measures of Association (Measures of Effect) Notation and terminology: Concepts apply to incidence proportions, incidence rates, and prevalence proportions, all of which will be loosely called “rates.” Let R1 represent the rate or risk of disease in the exposed group and let R0 represent the rate or risk of disease in the non-exposed group. Absolute Measure of Effect (Rate Difference) RD = R1 − R0 Relative Measure of Effect (Rate Ratio) R RR = 1 R0 The relative effect of an exposure can also captured by the SMR (see section on Rate Adjustment) 2-by-2 Cross-Tabulation D+ D− Total E+ (Group 1) A1 B1 N1 E− (Group 0) A0 B0 N0 M1 M0 N • For person-time data (incidence rates/densities) ignore cells B1 and B0 and let N1 and N0 represent the person-time in group 1 and group 0, respectively. A1 A0 AN11/ • Rates, Rate Ratio, and Rate Difference: R1 = , R0 = , RR = , and N1 N0 AN00/ RD = (AN11/)(/)− AN 0 0 (cohort and cross-sectional data) AB • Odds ratio: OR = 10(independent samples only; for matched-pairs and tuples data, see text) AB01 • Rounding: Basic measures should be reported with 2 or 3 significant digit accuracy. Carry 4 or 5 significant digits to derive a final answer that is accurate to 2 or 3 significant digits, respectively. 3.3 Measures of Potential Impact RR− RR −1 • The attributable fraction in exposed cases AF = 10, or equivalently, AF = . e e RR R1 RR− • The attributable fraction in the population AF = 0 p R Equivalent, AFpec= AF× p where pc represents proportion of population cases that are exposed Macintosh HD:Users:buddygerstman:Dropbox:eks:formula_sheet.doc Page 3 of 7 3.4 Rate Adjustment (“Standardization”) For uniformity of language, the term rate will be used to refer to any incidence or prevalence measure. Direct Standardization The directly adjusted rate (aRdirect) is a weighted average of strata-specific rates with weights derived from a reference population: aRdirect = ∑wiri N where w = i i N ∑ i Ni represents the size of strata i of the reference population ri represents rate in strata i of the study population. Note that capital letters denote values that come from the reference population and lower case letters denote values the come from the study population. Indirect Standardization Indirect standardization is based on the Standardized Mortality Ratio (SMR) Observed SMR = Expected where “Observed” is the observed number of cases and “Expected” is the expected number of cases in the population based on this formula: Expected R n where Ri represents the rate in strata i of the reference population = ∑ i i and ni represents the number of people strata i of the study population. The Expected in the population can be understood in terms of the expected number of cases within strata i, which is: Expected R n . Thus: Expected Expected . i = i i = ∑ i The SMR is a population-based relative risk estimate in which “1” represents a population in which the observed rate equals the expected rate. Optional: Use the SMR to derive the indirectly adjusted rate via this formula: aRindirect = (crude rate) × SMR Macintosh HD:Users:buddygerstman:Dropbox:eks:formula_sheet.doc Page 4 of 7 Chapter 10: Screening for Disease Reproducibility (Agreement) Rater B Rater A + − + a b g1 − c d g2 f1 f2 N pp ad+ fg11+ fg 2 2 obs− exp pobs = pexp = 2 κ = N N 1− pexp Validity (Sensitivity, Specificity, PVPT, PVNT) Disease + Disease − Total Test + TP FP TP + FP (those who test positive) Test − FN TN FN + TN (those who test negative) Total TP + FN FP + TN N (those with disease) (those w/out disease) SEN = (TP) / (those with disease) [note: TP = (SEN)(TP + FN)] = (TP) / (TP + FN) SPEC = (TN) / (those without disease) [note: TN = (SPEC)(FP + TN)] = (TN) / (TN + FP) PVP = (TP) / (those who test positive) = (TP) / (TP + FP) PVN = (TN) / (those who test negative) = (TN) / (TN + FN) True prevalence = (TP + FN) / N [also known as “prior probability”] Bayesian equivalents for PVP and PVN are presented in the text. Macintosh HD:Users:buddygerstman:Dropbox:eks:formula_sheet.doc Page 5 of 7 TEN COMMANDMENTS FOR DEALING WITH CONFOUNDING Source: EPIB-601 McGill University, Montreal, Canada [email protected], http://www.teachepi.org/documents/courses/Ten%20Commandments%20for%20Dealing%20with%20Con founding.pdf I. Always worry about confounding in your research, especially at the design/protocol stage. Try to use design elements (e.g. randomization) that will help reduce potential confounding. II. Prior to the study, review the literature and consider the underlying causal mechanisms (e.g. draw causal diagrams such as directed acyclic graphs [DAGs]). Then make sure you collect data on all potential confounders; otherwise you will not be able to adjust for them in your analyses. III. Know your field or collaborate with an expert who does! Subject-matter knowledge is important to recognize (e.g. draw causal diagrams) and adjust for confounding. IV. Use a priori and data-based methods to check if the potential confounders are indeed confounders that should be adjusted for. V. Use stratified analyses and multivariable methods to handle confounding at the analysis stage. Choose the multivariate model that best suits the type of data (e.g. dichotomous vs. continuous) you collected and the design you employed (e.g. case- control vs. cohort). VI. Do not adjust for covariates that may be intermediate causes (on the causal pathway between the exposure and disease). Do not adjust for covariates that may not be genuine confounders. And beware of time-varying covariates that will need special approaches. VII. Use matching with great caution. Use analytic methods that are appropriate for the design used; for example, if matching was done, use methods that take matching into account (e.g. conditional logistic regression, matched pairs analyses). VIII. Always consider effect measure modification, but perform and interpret subgroup analyses with caution. The subgroup analysis should be one of a small number of hypotheses tested, and the hypothesis should precede rather than follow the analysis (i.e. subgroups must be pre-specified). IX. Always remember that adjustment for confounding can be inadequate due to residual confounding because of unmeasured confounders, misclassification of confounders, and inadequate adjustment procedures (e.g. model misspecification, categorization of continuous covariates). X. If conventional methods prove to be inadequate, consider using newer approaches such as propensity scores, matched sampling, instrumental variables and marginal Macintosh HD:Users:buddygerstman:Dropbox:eks:formula_sheet.doc Page 6 of 7 structural models. However, make sure you work with statisticians who understand these new methods (not many do). When all else fails, pray! If prayer fails, consider changing professions!! Macintosh HD:Users:buddygerstman:Dropbox:eks:formula_sheet.doc Page 7 of 7 .

FORMULAS from EPIDEMIOLOGY KEPT SIMPLE (3E) Chapter 3: Epidemiologic Measures

Descriptive Statistics (Part 2): Interpreting Study Results

Meta-Analysis of Proportions

Von Mises' Frequentist Approach to Probability

Autocorrelation and Frequency-Resolved Optical Gating Measurements Based on the Third Harmonic Generation in a Gaseous Medium

FDA Oncology Experience in Innovative Adaptive Trial Designs

Incidence and Secondary Transmission of SARS-Cov-2 Infections in Schools

Statistical Guidance on Reporting Results from Studies Evaluating Diagnostic Tests Document Issued On: March 13, 2007

Estimated HIV Incidence and Prevalence in the United States

CHAPTER 3 Comparing Disease Frequencies

Fundamental Frequency Detection

Understanding Relative Risk, Odds Ratio, and Related Terms: As Simple As It Can Get Chittaranjan Andrade, MD

Outcome Reporting Bias in COVID-19 Mrna Vaccine Clinical Trials