A Joint Spatio-Temporal Model of Opioid Associated Deaths and Treatment Admissions in Ohio

Total Page:16

File Type:pdf, Size:1020Kb

A Joint Spatio-Temporal Model of Opioid Associated Deaths and Treatment Admissions in Ohio A JOINT SPATIO-TEMPORAL MODEL OF OPIOID ASSOCIATED DEATHS AND TREATMENT ADMISSIONS IN OHIO BY YIXUAN JI A Thesis Submitted to the Graduate Faculty of WAKE FOREST UNIVERSITY GRADUATE SCHOOL OF ARTS AND SCIENCES in Partial Fulfillment of the Requirements for the Degree of MASTER OF ARTS Mathematics and Statistics May 2019 Winston-Salem, North Carolina Approved By: Staci Hepler, Ph.D., Advisor Miaohua Jiang, Ph.D., Chair Robert Erhardt, Ph.D. Acknowledgments First and foremost, I would like to express my sincere gratitude and appreciation to my advisor, Dr. Staci A. Hepler. Without your continuous support and inspiration, I could not have achieved this thesis research. Thank you so much for your patience, motivation, enthusiasm, and immense knowledge, which made this research even more enjoyable. In addition to my advisor, I would also like to thank Dr. Miaohua Jiang and Dr. Robert Erhardt, for the constant encouragement and advice, and for serving on my thesis committee. Furthermore, this thesis benefited greatly from various courses that I took, including Bayesian statistics from Dr. Hepler, factor analysis and Markov Chain from Dr. Berenhaut, poisson linear model from Dr. Erhardt's Generalized Linear Model class, R programming skills from Dr. Nicole Dalzell and also reasoning skills from Dr. John Gemmer's Real Analysis class. Additionally, I would like to extend my thanks to Dr. Ellen Kirkman, Dr. Stephen Robinson and Dr. Sarah Raynor for their special attention and support to me, as an international student. Last but not least, I would like to thank my family for their sincere mental support throughout my life. Moreover, I appreciate the help and companionship from all my fellow graduate students, faculty and staff in the Mathematics and Statistics Department. ii Table of Contents Acknowledgments . ii Abstract . v List of Figures . vi List of Tables . vii Chapter 1 Introduction . 1 1.1 Opioid Misuse in Ohio . .1 1.2 Data Collection . .2 1.3 Overview Of Research . .7 Chapter 2 Background Information. 10 2.1 Introduction to Bayesian Statistics . 10 2.2 Markov Chain Monte Carlo . 13 2.2.1 Metropolis-Hastings Algorithm . 15 2.2.2 Gibbs Sampler . 16 2.3 Spatial Statistics . 18 2.3.1 CAR Model . 19 2.3.2 Spatio-temporal . 20 Chapter 3 Models . 22 3.1 Models . 22 3.2 Priors and Computation . 24 3.3 Model Selection . 27 Chapter 4 Conclusions . 29 4.1 Model Comparison . 29 4.2 Effects on Death Rates . 32 4.3 Effects on Treatment Rates . 33 4.4 Spatial Factors . 34 4.5 Scaled Loadings . 37 4.6 Independent Error Term . 39 iii Chapter 5 Discussion. 41 5.1 Censored Observations . 41 5.2 Covariates . 42 Bibliography . 45 Appendix A Description of covariates . 51 Appendix B Trace Plots . 53 B.0.1 R code . 53 B.0.2 Trace Plots . 53 Appendix C Relevant R code . 56 C.0.1 Model . 56 C.0.2 Map of Ohio . 61 Curriculum Vitae . 71 iv Abstract Opioid misuse is a major public health issue in the United States and in 2017, Ohio had the second highest age-adjusted drug overdose rate. In this thesis, we consider a joint spatio-temporal model of county-level surveillance data on deaths and treatment admissions using a latent spatial factor model. Our main goal is to estimate a common spatial factor, which offers a summary of the underlying joint burden of opioid misuse across space and time. The result is supposed to provide a valuable tool to allocate resources across the state in a timely manner. keywords: opioid epidemic, factor model, conditional autoregressive, spatio-temporal, resource allocation v List of Figures 1.1 Observed death rates . .4 1.2 Observed treatment rates . .6 1.3 Observed adolescent poverty . .7 4.1 Estimated variance ratio . 31 4.2 Posterior mean rate for death and treatment . 32 4.3 Diagram of the joint spatio-temporal model . 35 4.4 Spatial Factors . 36 4.5 Spatial Loadings . 38 4.6 Error Term D ............................... 39 4.7 Error Term T ............................... 40 B.1 Trace Plot . 55 vi List of Tables 1.1 Diagnostic Codes used to define treatment . .5 4.1 Posterior mean of intercepts β1, β3 and slopes β2, β4 with marginal coefficients eβ2 , eβ4 for death rates and treatment rates respectively. 34 4.2 Posterior standard deviation of intercepts β1, β3 and slopes β2, β4 for death rates and treatment rates respectively. 34 A.1 Covariates . 52 vii Chapter 1: Introduction 1.1 Opioid Misuse in Ohio Poisoning is one of the main causes of unintentional injury deaths in the United States, which are only second to motor vehicle crashes [9]. Almost all poisoning deaths result from drugs, and most drug poisonings are attributed to the abuse of prescription and illegal drugs. Overdose death rates varied according to the different types of drugs. Cocaine and heroin used to be the principal reason. However, deaths related to opoiods began increasing and accounted for a large number of deaths in the beginning of the 21st century [45]. Of note, concomitant use of benzodiaz-epines is implicated in the majority of prescription opioid-related overdose deaths [21]. Opioids are widely used for managing pain from acute injuries, surgeries, and advanced cancer. However, opioid misuse in some circumstances is associated with an elevated risk of early death, legal problems, and infectious diseases such as hepatitis C and human immunodeficiency virus (HIV). Accidental drug overdose, suicide, trauma and human immunodeficiency virus are attributed to the risk of premature mortality related to opioid overdose [11]. Opioid abuse is currently a national public epidemic and conditions on opioid misuse and abuse continue to deteriorate the United States. Opioid-related drug overdoses account for over 60% of drug overdose deaths within the United States, representing a fourfold increase within the past 15 years [7]. Treatment admissions for prescription opioid dependence rised more than fivefold since 2000 [35]. In particular, the drug overdose death rate in Ohio is above the national average. Opioid overdose deaths rearched 42,249 (66.4%) in 2016 (13.3 per 100,000 population), with a 27.9% rate increase from 2015 [37]. Of note, Ohio ranked top five among all states with the highest opioid-related overdose deaths. Ohio had the third highest 1 opioid overdose death rate (32.9 deaths per 100,000) in the country in 2016, which increased from 5.9 deaths per 100,000 in 2009 [14]. Fentanyl is an opioid used as a pain medication. However, it also emerged as a unprecedented threat to public health [17]. There have been numerous reports of inadvertent overuse, intentional misuse, and outbreaks of overdoses with fentanyl. In 2017, fentanyl and related drugs like carfentanil were involved in 70.7 percent of overdose deaths in Ohio; while fentanyl was involved involved in 58.2 percent in 2016, 37.9 percent in 2015, and 19.9 percent in 2014 [28]. The highest rates of affected areas were observed in the Southwestern portion of Ohio [26]. In this thesis, our overall goal is to quantify the burden of the opioid epidemic at the county level. Modeling illicit activity is usually accomplished through survey data. However, survey data at the county level is unavailable or sparse at best. Tra- ditional public health surveys of hard to reach and stigmatized populations are likely to be underreported and prohibitively expensive [29]. Complex designed surveys, like respondent driven sampling [18], are often conducted to deal with this problems but it is not common because of the time intensive nature of this design [31]. In Ohio, surveillance data on opioid associated deaths and treatment admissions are routinely collected at the county level. In this research, we develop a multivariate spatio- temporal factor model to jointly model the available county level surveillance data. The factor model framework synthesises the information in the multiple outcomes, providing an overall measure of the burden of the opioid epididemic at the county level [19]. 1.2 Data Collection In this thesis, we will use surveillance data collected by the state of Ohio routinely: opioid associated deaths and treatment admissions. We obtained these surveillance 2 data to conduct a joint spatio-temporal model of opioid associated deaths and treat- ment admissions for all of Ohio's 88 counties for the most recent available years, 2007- 2016. Death data were collected from the Ohio Public Health Data Warehouse Ohio Resident Mortality Data and these counts are publicly accessible from the Ohio De- partment of Health Website (http://publicapps.odh.ohio.gov/EDW/DataCatalog). All resident deaths where poisoning from any opiate is mentioned on the death cer- tification are included in the death counts. We count death for the county where the decedent resided in Ohio no matter place of death. We searched the underlying cause of death dataset for deaths with an International Classification of Diseases-10 (ICD) code T40.0-T40.4 and T40.6, defined as "poison from any opiate." Population estimates were collected from the Center's for Disease Control and Prevention (CDC) under the category of demographics. Observed county death rates per 100,000 res- idents for each county are shown in Figure 1.1. It is clear that higher death rate across the Southern Ohio from Figure 1.1. We also notice the clustered high death rates concentrated in the Southwestern of Ohio in the latest 5 years while it primarily centered in the Southeastern of Ohio from 2007 to 2011. 3 Figure 1.1: Observed death rates Treatment admission data were collected from the Ohio Department of Mental Health and Drug Addiction Services through a data use agreement. Treatment admis- sion were assessed according to the diagnostic codes listed in Table 1.1 and contained any residential, intensive outpatient, or outpatient treatment for opioid misuse.
Recommended publications
  • September 2010
    THE ISBA BULLETIN Vol. 17 No. 3 September 2010 The official bulletin of the International Society for Bayesian Analysis AMESSAGE FROM THE membership renewal (there will be special provi- PRESIDENT sions for members who hold multi-year member- ships). Peter Muller¨ The Bayesian Nonparametrics Section (IS- ISBA President, 2010 BA/BNP) is already up and running, and plan- [email protected] ning the 2011 BNP workshop. Please see the News from the World section in this Bulletin First some sad news. In August we lost two and our homepage (select “business” and “mee- big Bayesians. Julian Besag passed away on Au- tings”). gust 6, and Arnold Zellner passed away on Au- gust 11. Arnold was one of the founding ISBA ISBA/SBSS Educational Initiative. Jointly presidents and was instrumental to get Bayesian with ASA/SBSS (American Statistical Associa- started. Obituaries on this issue of the Analysis tion, Section on Bayesian Statistical Science) we Bulletin and on our homepage acknowledge Juli- launched a new joint educational initiative. The an and Arnold’s pathbreaking contributions and initiative formalized a long standing history of their impact on the lives of many people in our collaboration of ISBA and ASA/SBSS in related research community. They will be missed dearly! matters. Continued on page 2. ISBA Elections 2010. Please check out the elec- tion statements of the candidates for the upco- In this issue ming ISBA elections. We have an amazing slate ‰ A MESSAGE FROM THE BA EDITOR of candidates. Thanks to the nominating commit- *Page 2 tee, Mike West (chair), Renato Martins Assuncao,˜ Jennifer Hill, Beatrix Jones, Jaeyong Lee, Yasuhi- ‰ 2010 ISBA ELECTION *Page 3 ro Omori and Gareth Robert! ‰ SAVAGE AWARD AND MITCHELL PRIZE 2010 *Page 8 Sections.
    [Show full text]
  • IMS Bulletin 39(4)
    Volume 39 • Issue 4 IMS1935–2010 Bulletin May 2010 Meet the 2010 candidates Contents 1 IMS Elections 2–3 Members’ News: new ISI members; Adrian Raftery; It’s time for the 2010 IMS elections, and Richard Smith; we introduce this year’s nominees who are IMS Collections vol 5 standing for IMS President-Elect and for IMS Council. You can read all the candi- 4 IMS Election candidates dates’ statements, starting on page 4. 9 Amendments to This year there are also amendments Constitution and Bylaws to the Constitution and Bylaws to vote Letter to the Editor 11 on: they are listed The candidate for IMS President-Elect is Medallion Preview: Laurens on page 9. 13 Ruth Williams de Haan Voting is open until June 26, so 14 COPSS Fisher lecture: Bruce https://secure.imstat.org/secure/vote2010/vote2010.asp Lindsay please visit to cast your vote! 15 Rick’s Ramblings: March Madness 16 Terence’s Stuff: And ANOVA thing 17 IMS meetings 27 Other meetings 30 Employment Opportunities 31 International Calendar of Statistical Events The ten Council candidates, clockwise from top left, are: 35 Information for Advertisers Krzysztof Burdzy, Francisco Cribari-Neto, Arnoldo Frigessi, Peter Kim, Steve Lalley, Neal Madras, Gennady Samorodnitsky, Ingrid Van Keilegom, Yazhen Wang and Wing H Wong Abstract submission deadline extended to April 30 IMS Bulletin 2 . IMs Bulletin Volume 39 . Issue 4 Volume 39 • Issue 4 May 2010 IMS members’ news ISSN 1544-1881 International Statistical Institute elects new members Contact information Among the 54 new elected ISI members are several IMS members. We congratulate IMS IMS Bulletin Editor: Xuming He Fellow Jon Wellner, and IMS members: Subhabrata Chakraborti, USA; Liliana Forzani, Assistant Editor: Tati Howell Argentina; Ronald D.
    [Show full text]
  • Causal Diagrams for Interference Elizabeth L
    STATISTICAL SCIENCE Volume 29, Number 4 November 2014 Special Issue on Semiparametrics and Causal Inference Causal Etiology of the Research of James M. Robins ..............................................Thomas S. Richardson and Andrea Rotnitzky 459 Doubly Robust Policy Evaluation and Optimization ..........................Miroslav Dudík, Dumitru Erhan, John Langford and Lihong Li 485 Statistics,CausalityandBell’sTheorem.....................................Richard D. Gill 512 Standardization and Control for Confounding in Observational Studies: A Historical Perspective..............................................Niels Keiding and David Clayton 529 CausalDiagramsforInterference............Elizabeth L. Ogburn and Tyler J. VanderWeele 559 External Validity: From do-calculus to Transportability Across Populations .........................................................Judea Pearl and Elias Bareinboim 579 Nonparametric Bounds and Sensitivity Analysis of Treatment Effects .................Amy Richardson, Michael G. Hudgens, Peter B. Gilbert and Jason P. Fine 596 The Bayesian Analysis of Complex, High-Dimensional Models: Can It Be CODA? .......................................Y.Ritov,P.J.Bickel,A.C.GamstandB.J.K.Kleijn 619 Q-andA-learning Methods for Estimating Optimal Dynamic Treatment Regimes .............. Phillip J. Schulte, Anastasios A. Tsiatis, Eric B. Laber and Marie Davidian 640 A Uniformly Consistent Estimator of Causal Effects under the k-Triangle-Faithfulness Assumption...................................................Peter Spirtes
    [Show full text]
  • Tea: a High-Level Language and Runtime System for Automating
    Tea: A High-level Language and Runtime System for Automating Statistical Analysis Eunice Jun1, Maureen Daum1, Jared Roesch1, Sarah Chasins2, Emery Berger34, Rene Just1, Katharina Reinecke1 1University of Washington, 2University of California, 3University of Seattle, WA Berkeley, CA Massachusetts Amherst, femjun, mdaum, jroesch, [email protected] 4Microsoft Research, rjust, [email protected] Redmond, WA [email protected] ABSTRACT Since the development of modern statistical methods (e.g., Though statistical analyses are centered on research questions Student’s t-test, ANOVA, etc.), statisticians have acknowl- and hypotheses, current statistical analysis tools are not. Users edged the difficulty of identifying which statistical tests people must first translate their hypotheses into specific statistical should use to answer their specific research questions. Almost tests and then perform API calls with functions and parame- a century later, choosing appropriate statistical tests for eval- ters. To do so accurately requires that users have statistical uating a hypothesis remains a challenge. As a consequence, expertise. To lower this barrier to valid, replicable statistical errors in statistical analyses are common [26], especially given analysis, we introduce Tea, a high-level declarative language that data analysis has become a common task for people with and runtime system. In Tea, users express their study de- little to no statistical expertise. sign, any parametric assumptions, and their hypotheses. Tea A wide variety of tools (such as SPSS [55], SAS [54], and compiles these high-level specifications into a constraint satis- JMP [52]), programming languages (e.g., R [53]), and libraries faction problem that determines the set of valid statistical tests (including numpy [40], scipy [23], and statsmodels [45]), en- and then executes them to test the hypothesis.
    [Show full text]
  • December 2000
    THE ISBA BULLETIN Vol. 7 No. 4 December 2000 The o±cial bulletin of the International Society for Bayesian Analysis A WORD FROM already lays out all the elements mere statisticians might have THE PRESIDENT of the philosophical position anything to say to them that by Philip Dawid that he was to continue to could possibly be worth ISBA President develop and promote (to a listening to. I recently acted as [email protected] largely uncomprehending an expert witness for the audience) for the rest of his life. defence in a murder appeal, Radical Probabilism He is utterly uncompromising which revolved around a Modern Bayesianism is doing in his rejection of the realist variant of the “Prosecutor’s a wonderful job in an enormous conception that Probability is Fallacy” (the confusion of range of applied activities, somehow “out there in the world”, P (innocencejevidence) with supplying modelling, data and in his pragmatist emphasis P ('evidencejinnocence)). $ analysis and inference on Subjective Probability as Contents procedures to nourish parts that something that can be measured other techniques cannot reach. and regulated by suitable ➤ ISBA Elections and Logo But Bayesianism is far more instruments (betting behaviour, ☛ Page 2 than a bag of tricks for helping or proper scoring rules). other specialists out with their What de Finetti constructed ➤ Interview with Lindley tricky problems – it is a totally was, essentially, a whole new ☛ Page 3 original way of thinking about theory of logic – in the broad ➤ New prizes the world we live in. I was sense of principles for thinking ☛ Page 5 forcibly struck by this when I and learning about how the had to deliver some brief world behaves.
    [Show full text]
  • The BUGS Project: Evolution, Critique and Future Directions
    STATISTICS IN MEDICINE Statist. Med. 2009; 28:3049–3067 Published online 24 July 2009 in Wiley InterScience (www.interscience.wiley.com) DOI: 10.1002/sim.3680 The BUGS project: Evolution, critique and future directions 1, , 1 2 3 David Lunn ∗ †, David Spiegelhalter ,AndrewThomas and Nicky Best 1Medical Research Council Biostatistics Unit, Institute of Public Health, University Forvie Site, Robinson Way, Cambridge CB2 0SR, U.K. 2School of Mathematics and Statistics, Mathematical Institute, North Haugh, St. Andrews, Fife KY16 9SS, U.K. 3Department of Epidemiology and Public Health, Imperial College London, St. Mary’s Campus, Norfolk Place, London W2 1PG, U.K. SUMMARY BUGS is a software package for Bayesian inference using Gibbs sampling. The software has been instru- mental in raising awareness of Bayesian modelling among both academic and commercial communities internationally, and has enjoyed considerable success over its 20-year life span. Despite this, the software has a number of shortcomings and a principal aim of this paper is to provide a balanced critical appraisal, in particular highlighting how various ideas have led to unprecedented flexibility while at the same time producing negative side effects. We also present a historical overview of the BUGS project and some future perspectives. Copyright 2009 John Wiley & Sons, Ltd. KEY WORDS: BUGS; WinBUGS; OpenBUGS; Bayesian modelling; graphical models 1. INTRODUCTION BUGS 1 is a software package for performing Bayesian inference using Gibbs sampling 2, 3 . The BUGS[ ] project began at the Medical Research Council Biostatistics Unit in Cambridge in[ 1989.] Since that time the software has become one of the most popular statistical modelling packages, with, at the time of writing, over 30000 registered users of WinBUGS (the Microsoft Windows incarnation of the software) worldwide, and an active on-line community comprising over 8000 members.
    [Show full text]
  • Design and Analysis Issues in Family-Based Association
    Emerging Challenges in Statistical Genetics Duncan Thomas University of Southern California Human Genetics in the Big Science Era • “Big Data” – large n and large p and complexity e.g., NIH Biomedical Big Data Initiative (RFA-HG-14-020) • Large n: challenge for computation and data storage, but not conceptual • Large p: many data mining approaches, few grounded in statistical principles • Sparse penalized regression & hierarchical modeling from Bayesian and frequentist perspectives • Emerging –omics challenges Genetics: from Fisher to GWAS • Population genetics & heritability – Mendel / Fisher / Haldane / Wright • Segregation analysis – Likelihoods on complex pedigrees by peeling: Elston & Stewart • Linkage analysis (PCR / microsats / SNPs) – Multipoint: Lander & Green – MCMC: Thompson • Association – TDT, FBATs, etc: Spielman, Laird – GWAS: Risch & Merikangas – Post-GWAS: pathway mining, next-gen sequencing Association: From hypothesis-driven to agnostic research Candidate pathways Candidate Hierarchical GWAS genes models (ht-SNPs) Ontologies Pathway mining MRC BSU SGX Plans Objectives: – Integrating structural and prior information for sparse regression analysis of high dimensional data – Clustering models for exposure-disease associations – Integrating network information – Penalised regression and Bayesian variable selection – Mechanistic models of cellular processes – Statistical computing for large scale genomics data Targeted areas of impact : – gene regulation and immunological response – biomarker based signatures – targeting
    [Show full text]
  • Stan User's Guide 2.27
    Stan User’s Guide Version 2.27 Stan Development Team Contents Overview 9 Part 1. Example Models 11 1. Regression Models 12 1.1 Linear regression 12 1.2 The QR reparameterization 14 1.3 Priors for coefficients and scales 16 1.4 Robust noise models 16 1.5 Logistic and probit regression 17 1.6 Multi-logit regression 19 1.7 Parameterizing centered vectors 21 1.8 Ordered logistic and probit regression 24 1.9 Hierarchical logistic regression 25 1.10 Hierarchical priors 28 1.11 Item-response theory models 29 1.12 Priors for identifiability 32 1.13 Multivariate priors for hierarchical models 33 1.14 Prediction, forecasting, and backcasting 41 1.15 Multivariate outcomes 42 1.16 Applications of pseudorandom number generation 48 2. Time-Series Models 51 2.1 Autoregressive models 51 2.2 Modeling temporal heteroscedasticity 54 2.3 Moving average models 55 2.4 Autoregressive moving average models 58 2.5 Stochastic volatility models 60 2.6 Hidden Markov models 63 3. Missing Data and Partially Known Parameters 70 1 CONTENTS 2 3.1 Missing data 70 3.2 Partially known parameters 71 3.3 Sliced missing data 72 3.4 Loading matrix for factor analysis 73 3.5 Missing multivariate data 74 4. Truncated or Censored Data 77 4.1 Truncated distributions 77 4.2 Truncated data 77 4.3 Censored data 79 5. Finite Mixtures 82 5.1 Relation to clustering 82 5.2 Latent discrete parameterization 82 5.3 Summing out the responsibility parameter 83 5.4 Vectorizing mixtures 86 5.5 Inferences supported by mixtures 87 5.6 Zero-inflated and hurdle models 90 5.7 Priors and effective data size in mixture models 94 6.
    [Show full text]
  • Harrison Zhou
    Volume 39 • Issue 3 IMS1935–2010 Bulletin April 2010 Harrison Zhou: Tweedie Award Harrison Zhou receives 2010 IMS Tweedie New Researcher Award Contents The Institute of Mathematical Statistics has selected Harrison Zhou 1 Tweedie Award: Harrison as the winner of this year’s Tweedie New Researcher Award. Dr Zhou Zhou received his PhD in 2004 from Cornell University, and is 2 Members’ News: Iain currently an Associate Professor at Yale University. Johnstone; Ingram Olkin The IMS Travel Awards Committee selected Dr Zhou for “innovative and significant contributions to the theory and 3 Journal of Privacy and methods of nonparametric function estimation; for outstanding Confidentiality; NSF news Harrison Zhou contributions to high-dimensional statistical inference, including 4 Wisconsin Stat Dept estimation of large covariance matrices and sparse signals.” celebrates 50th birthday Dr Zhou said, “I am very much honored and humbled by this award. I am particu- 5 Meeting report: ICCS-X larly happy that the award is in honor of Richard Tweedie, a scholar who made many 6 Obituary: James F Hannan important contributions to our profession, including his extremely generous mentoring of young researchers. I myself have benefited greatly from the advice of my mentors Report: Zacks mini- 7 Michael Nussbaum, Larry Brown, Tony Cai, Mark Low and David Pollard, who have conference helped me understand some pieces of Le Cam’s work, the inspiration for my own 9 Rick’s Ramblings: How to research.” get your paper cited The IMS Tweedie New Researcher Award will fund Dr. Zhou’s travel to present 10 The Renaissance the Tweedie New Researcher Invited Lecture at the Thirteenth IMS Meeting of New Statistician Researchers in Statistics and Probability, held this year in Vancouver, BC, Canada, from 11 Terence’s Stuff: Firing July 27 to 30.
    [Show full text]
  • Alan Agresti
    Historical Highlights in the Development of Categorical Data Analysis Alan Agresti Department of Statistics, University of Florida UF Winter Workshop 2010 CDA History – p. 1/39 Karl Pearson (1857-1936) CDA History – p. 2/39 Karl Pearson (1900) Philos. Mag. Introduces chi-squared statistic (observed − expected)2 X2 = X expected df = no. categories − 1 • testing values for multinomial probabilities (Monte Carlo roulette runs) • testing fit of Pearson curves • testing statistical independence in r × c contingency table (df = rc − 1) CDA History – p. 3/39 Karl Pearson (1904) Advocates measuring association in contingency tables by approximating the correlation for an assumed underlying continuous distribution • tetrachoric correlation (2 × 2, assuming bivariate normality) X2 2 • contingency coefficient 2 based on for testing q X +n X independence in r × c contingency table • introduces term “contingency” as a “measure of the total deviation of the classification from independent probability.” CDA History – p. 4/39 George Udny Yule (1871-1951) (1900) Philos. Trans. Royal Soc. London (1912) JRSS Advocates measuring association using odds ratio n11 n12 n21 n22 n n n n − n n θˆ = 11 22 Q = 11 22 12 21 = (θˆ − 1)/(θˆ + 1) n12n21 n11n22 + n12n21 “At best the normal coefficient can only be said to give us. a hypothetical correlation between supposititious variables. The introduction of needless and unverifiable hypotheses does not appear to me a desirable proceeding in scientific work.” (1911) An Introduction to the Theory of Statistics (14 editions) CDA History – p. 5/39 K. Pearson, with D. Heron (1913) Biometrika “Unthinking praise has been bestowed on a textbook which can only lead statistical students hopelessly astray.” .
    [Show full text]
  • Invited Conference Speakers
    Medical Research Council Conference on Biostatistics in celebration of the MRC Biostatistics Unit's Centenary Year 24th - 26th March 2014 | Queens' College Cambridge, UK Invited Conference Speakers Professor Tony Ades, School of Social and Community Medicine, University of Bristol Tony Ades is Professor of Public Health Science at the University of Bristol and leads a programme on methods for evidence synthesis in epidemiology and decision modelling. This was originally funded through the MRC Health Services Research Collaboration. With Guobing Lu, Nicky Welton, Sofia DIas, Debbi Caldwell, Malcolm Price, Aicha Goubar and many collaborators in Cambridge and elsewhere, the programme has contributed original research on Network Meta- analysis, multi-parameter synthesis models for the epidemiology of HIV and chlamydia, synthesis for Markov models, expected value of information, and other topics. Tony was a member of the Appraisals Committee at the National Institute for Clinical Excellence (NICE), 2003-2013, and was awarded a lifetime achievement award in 2010 by the Society for Research Synthesis Methodology. Seminar Title: Synthesis of treatment effects, Mappings between outcomes, and test responsiveness Tuesday 25 March, 11.00-12.30 session Abstract: Synthesis of treatment effects, mappings between outcomes, standardisation, and test responsiveness AE Ades, Guobing Lu, Daphne Kounali. School of Social and Community Medicine, University of Bristol Abstract: We shall report on some experiments with a new class of models for synthesis of treatment effect evidence on “similar” outcomes, within and between trials. They are intended particularly for synthesis of patient- and clinician- reported outcomes that are subject to measurement error. These models assume that treatment effects on different outcomes are in fixed, or approximately fixed, ratios across trials.
    [Show full text]
  • Meta-Analysis of Diagnostic Test Data: Modern Statistical Approaches
    Meta-Analysis of Diagnostic Test Data: Modern Statistical Approaches Inaugural Dissertation zur Erlangung des Doktorgrades der Mathematisch-Naturwissenschaftlichen Fakult at¨ der Heinrich-Heine Universit at¨ D usseldorf¨ vorgelegt von Pablo Emilio Verde aus Buenos Aires, Argentinien Juni 2008 Aus der Koordinierungszentrum f ur¨ Klinische Studien Heinrich-Heine Universit at¨ D usseldorf¨ Gedruckt mit der Genehmigung der Mathematisch-Naturwissenschaftlichen Fakult at¨ der Heinrich-Heine-Universit at¨ D usseldorf¨ Referent: Prof. Dr. Christian Ohmann Koreferent: Prof. Dr. Martin Lercher Tag der m undlichen¨ Pr ufung:¨ 08.07.2008 To Isabel, Luc ´ıa, Heide, my Mother and to the memory of my Father. iv Acknowledgments I am very thankful to Christian Ohmann for supporting my work during these years. He has allowed me to work with the intellectual freedom, that I needed to develop and to accomplish this dissertation. I am very grateful to Martin Lercher for welcoming my dissertation, without his support and trust this work would not be possible. This work was partially financed by the Swiss National Science Foundation (51A240-104890). I thank Johannes Siegrist for supporting this work. In the summer of 2007, I had the honor to be invited to teach in the Department of Statistics at Stanford University. During my stay I presented an early version of this work. I am immensely grateful to Brad Efron, Trevor Hastie and Ingram Olkin for discussions and several suggestions to improve my work. I am also grateful to Persi Diaconis for numerous enlightening conversations on Bayesian statistics. My sincere acknowledgement to my professor in Argentina Martin Grondona. He has constantly encouraged me to finalize my dissertation.
    [Show full text]