Tea: a High-Level Language and Runtime System for Automating Statistical Analyses

Total Page:16

File Type:pdf, Size:1020Kb

Tea: a High-Level Language and Runtime System for Automating Statistical Analyses Tea: A High-level Language and Runtime System for Automating Statistical Analyses Eunice Jun, Maureen Daum, Jared Roesch Sarah E. Chasins University of Washington University of California, Berkeley Emery D. Berger Rene Just, Katharina Reinecke University of Massachusetts Amherst University of Washington Figure 1: Overview of Tea’s language constructs and its runtime system’s compilation process. The colors correspond to the annotations provided in Figure 2. ABSTRACT KEYWORDS Though statistical analyses are centered on research ques- statistical analysis, declarative programming language tions and hypotheses, current statistical analysis tools are not. Users must first translate their hypotheses into specific statistical tests and then perform API calls with functions and parameters. To do so accurately requires that users have 1 INTRODUCTION statistical expertise. To lower this barrier to valid, replicable statistical analysis, we introduce Tea1, a high-level declar- The enormous variety of modern quantitative ative language and runtime system. In Tea, users express methods leaves researchers with the nontrivial their study design, any parametric assumptions, and their task of matching analysis and design to the re- hypotheses. Tea compiles these high-level specifications into search question. - Ronald Fisher [11] a constraint satisfaction problem that determines the set of Since the development of modern statistical methods (e.g., valid statistical tests, and then executes them to test the hy- Student’s t-test, ANOVA, etc.), statisticians have acknowl- pothesis. We evaluate Tea using a suite of statistical analyses edged the difficulty of identifying which statistical tests peo- arXiv:1904.05387v1 [cs.PL] 10 Apr 2019 drawn from popular tutorials. We show that Tea generally ple should use to answer their specific research questions. matches the choices of experts while automatically switch- Almost a century later, choosing appropriate statistical tests ing to non-parametric tests when parametric assumptions for evaluating a hypothesis remains a challenge. As a con- are not met. We simulate the effect of mistakes made by non- sequence, errors in statistical analyses are common [20], expert users and show that Tea automatically avoids both especially given that data analysis has become a common false negatives and false positives that could be produced by task for people with little to no statistical expertise. the application of incorrect statistical tests. A wide variety of tools (such as SPSS [46], SAS [45], and JMP [43]), programming languages (R [44]), and libraries 1named after Fisher’s “Lady Tasting Tea” experiment [11] (including numpy [32], scipy [17], and statsmodels [37]), enable people to perform specific statistical tests, but they do not address the fundamental problem that users may not arxiv, April 2019, on-line know which statistical test to perform and how to verify that 2019. https://doi.org/0000001.0000001 specific assumptions about their data hold. In fact, all of these tools place the burden of valid, replica- limitations and future work and how Tea is different from ble statistical analyses on the user and demand deep knowl- related work. We conclude by providing information on how edge of statistics. Users not only have to identify their re- to use Tea. search questions, hypotheses, and domain assumptions, but also must select statistical tests for their hypotheses (e.g., 2 USAGE SCENARIO Student’s t-test or one-way ANOVA). For each statistical This section describes how an analyst who has no statistical test, users must be aware of the statistical assumptions each background can use Tea to answer their research questions. test makes about the data (e.g., normality or equal variance We use as an example analyst a historical criminologist who between groups) and how to check for them, which requires wants to determine how imprisonment differed across re- additional statistical tests (e.g., Levene’s test for equal vari- gions of the US in 19602. Figure 2 shows the Tea code for ance), which themselves may demand further assumptions this example. about the data. This entire process requires significant knowl- The analyst specifies the data file’s path in Tea. Tea han- edge about statistical tests and their preconditions, as well dles loading and storing the data set for the duration of the as the ability to perform the tests and verify their precondi- analysis session. The analyst does not have to worry about tions. This cognitively demanding process can easily lead to transforming the data in any way. mistakes. The analyst asks if the probability of imprisonment was This paper presents Tea, a high-level declarative language higher in southern states than in non-southern states. The for automating statistical test selection and execution that analyst identifies two variables that could help them answer abstracts the details of statistical analysis from the users. Tea this question: the probability of imprisonment (‘Prob’) and captures users’ hypotheses and domain knowledge, trans- geographic location (‘So’). for non-southern. Using Tea, the lates this information into a constraint satisfaction problem, analyst defines the geographic location as a dichotomous identifies all valid statistical tests to evaluate a hypothesis, nominal variable where ‘1’ indicates a southern state and and executes the tests. Figure 1 illustrates Tea’s compilation ‘0’ indicates a non-southern state, and indicates that the process. Tea’s higher-level, declarative nature aims to lower probability of imprisonment is a numeric data type (ratio) the barrier to valid, replicable analyses. with a range between 0 and 1. We have designed Tea to integrate directly into common The analyst then specifies their study design, defining the data analysis workflows for users who have minimal pro- study type to be ‘observational study’ (rather than experi- gramming experience. Tea is implemented as an open-source mental study) and defining the independent variable to be the Python library, so programmers can use Tea wherever they geographic location and the outcome (dependent) variable use Python, including within Python notebooks. to be the probability of imprisonment. In addition, Tea is flexible. Its abstraction of the analy- Based on their prior research, the analyst knows that the sis process and use of a constraint solver to select tests is probability of imprisonment in southern and non-southern designed to support its extension to emerging statistical states is normally distributed. The analyst provides an as- methods, such as Bayesian analysis. Currently, Tea supports sumptions clause to Tea in which they specify this domain frequentist Null Hypothesis Significance Testing (NHST). knowledge. They also specify an acceptable Type I error rate The paper makes the following contributions: (probability of finding a false positive result), more collo- • Tea, a novel domain-specific language (DSL) for auto- quially known as the ‘significance threshold’α ( = :05) that matically selecting and executing statistical analyses is acceptable in criminology. If the analyst does not have based on users’ hypotheses and domain knowledge assumptions or forgets to provide assumptions, Tea will use ( Section 4), the default of α = :05. • the Tea runtime system, which formulates statistical The analyst hypothesizes that southern states will have test selection as a maximum constraint satisfaction a higher probability of imprisonment than non-southern problem ( Section 5), and states. The analyst directly expresses this hypothesis in Tea. • an initial evaluation showing that Tea can express and Note that at no point does the analyst indicate which statistical execute common NHST statistical tests ( Section 6). tests should be performed. From this point on, Tea operates entirely automatically. We start with a usage scenario that provides an overview When the analyst runs their Tea program, Tea checks proper- of Tea (Section 2). We discuss the concerns about statistics in ties of the data and finds that Student’s t-test is appropriate. the HCI community that shaped Tea’s design (Section 3), the implementation of Tea’s programming language (Section 4), the implementation of Tea’s runtime system (Section 5), and 2The example is taken from [8] and [39]. The data set comes as part of the the evaluation of Tea as a whole (Section 6). We then discuss MASS package in R. Tea executes the Student’s t-test and non-parametric alter- natives, such as the Mann-Whitney U test, which provide alternative, consistent results. Tea generates a table of results from executing the tests, ordered by their power (i.e., results from the parametric t- test will be listed first given that it has higher power than the non-parametric equivalent). Based on this output, the analyst concludes that their hypothesis—that the probabil- ity of imprisonment was higher in southern states than in non-southern states in 1960—is supported. The results from alternative statistical tests support this conclusion, so the analyst can be confident in their assessment. The analyst can now share their Tea program with col- leagues. Other researchers can easily see what assumptions the analyst made and what the intended hypothesis was (since these are explicitly stated in the Tea program), and reproduce the exact results using Tea. 3 DESIGN CONSIDERATIONS In designing Tea’s language and runtime system, we consid- ered best practices for conducting statistical analyses and Figure 2: Sample Tea program specification
Recommended publications
  • September 2010
    THE ISBA BULLETIN Vol. 17 No. 3 September 2010 The official bulletin of the International Society for Bayesian Analysis AMESSAGE FROM THE membership renewal (there will be special provi- PRESIDENT sions for members who hold multi-year member- ships). Peter Muller¨ The Bayesian Nonparametrics Section (IS- ISBA President, 2010 BA/BNP) is already up and running, and plan- [email protected] ning the 2011 BNP workshop. Please see the News from the World section in this Bulletin First some sad news. In August we lost two and our homepage (select “business” and “mee- big Bayesians. Julian Besag passed away on Au- tings”). gust 6, and Arnold Zellner passed away on Au- gust 11. Arnold was one of the founding ISBA ISBA/SBSS Educational Initiative. Jointly presidents and was instrumental to get Bayesian with ASA/SBSS (American Statistical Associa- started. Obituaries on this issue of the Analysis tion, Section on Bayesian Statistical Science) we Bulletin and on our homepage acknowledge Juli- launched a new joint educational initiative. The an and Arnold’s pathbreaking contributions and initiative formalized a long standing history of their impact on the lives of many people in our collaboration of ISBA and ASA/SBSS in related research community. They will be missed dearly! matters. Continued on page 2. ISBA Elections 2010. Please check out the elec- tion statements of the candidates for the upco- In this issue ming ISBA elections. We have an amazing slate ‰ A MESSAGE FROM THE BA EDITOR of candidates. Thanks to the nominating commit- *Page 2 tee, Mike West (chair), Renato Martins Assuncao,˜ Jennifer Hill, Beatrix Jones, Jaeyong Lee, Yasuhi- ‰ 2010 ISBA ELECTION *Page 3 ro Omori and Gareth Robert! ‰ SAVAGE AWARD AND MITCHELL PRIZE 2010 *Page 8 Sections.
    [Show full text]
  • IMS Bulletin 39(4)
    Volume 39 • Issue 4 IMS1935–2010 Bulletin May 2010 Meet the 2010 candidates Contents 1 IMS Elections 2–3 Members’ News: new ISI members; Adrian Raftery; It’s time for the 2010 IMS elections, and Richard Smith; we introduce this year’s nominees who are IMS Collections vol 5 standing for IMS President-Elect and for IMS Council. You can read all the candi- 4 IMS Election candidates dates’ statements, starting on page 4. 9 Amendments to This year there are also amendments Constitution and Bylaws to the Constitution and Bylaws to vote Letter to the Editor 11 on: they are listed The candidate for IMS President-Elect is Medallion Preview: Laurens on page 9. 13 Ruth Williams de Haan Voting is open until June 26, so 14 COPSS Fisher lecture: Bruce https://secure.imstat.org/secure/vote2010/vote2010.asp Lindsay please visit to cast your vote! 15 Rick’s Ramblings: March Madness 16 Terence’s Stuff: And ANOVA thing 17 IMS meetings 27 Other meetings 30 Employment Opportunities 31 International Calendar of Statistical Events The ten Council candidates, clockwise from top left, are: 35 Information for Advertisers Krzysztof Burdzy, Francisco Cribari-Neto, Arnoldo Frigessi, Peter Kim, Steve Lalley, Neal Madras, Gennady Samorodnitsky, Ingrid Van Keilegom, Yazhen Wang and Wing H Wong Abstract submission deadline extended to April 30 IMS Bulletin 2 . IMs Bulletin Volume 39 . Issue 4 Volume 39 • Issue 4 May 2010 IMS members’ news ISSN 1544-1881 International Statistical Institute elects new members Contact information Among the 54 new elected ISI members are several IMS members. We congratulate IMS IMS Bulletin Editor: Xuming He Fellow Jon Wellner, and IMS members: Subhabrata Chakraborti, USA; Liliana Forzani, Assistant Editor: Tati Howell Argentina; Ronald D.
    [Show full text]
  • Tea: a High-Level Language and Runtime System for Automating
    Tea: A High-level Language and Runtime System for Automating Statistical Analysis Eunice Jun1, Maureen Daum1, Jared Roesch1, Sarah Chasins2, Emery Berger34, Rene Just1, Katharina Reinecke1 1University of Washington, 2University of California, 3University of Seattle, WA Berkeley, CA Massachusetts Amherst, femjun, mdaum, jroesch, [email protected] 4Microsoft Research, rjust, [email protected] Redmond, WA [email protected] ABSTRACT Since the development of modern statistical methods (e.g., Though statistical analyses are centered on research questions Student’s t-test, ANOVA, etc.), statisticians have acknowl- and hypotheses, current statistical analysis tools are not. Users edged the difficulty of identifying which statistical tests people must first translate their hypotheses into specific statistical should use to answer their specific research questions. Almost tests and then perform API calls with functions and parame- a century later, choosing appropriate statistical tests for eval- ters. To do so accurately requires that users have statistical uating a hypothesis remains a challenge. As a consequence, expertise. To lower this barrier to valid, replicable statistical errors in statistical analyses are common [26], especially given analysis, we introduce Tea, a high-level declarative language that data analysis has become a common task for people with and runtime system. In Tea, users express their study de- little to no statistical expertise. sign, any parametric assumptions, and their hypotheses. Tea A wide variety of tools (such as SPSS [55], SAS [54], and compiles these high-level specifications into a constraint satis- JMP [52]), programming languages (e.g., R [53]), and libraries faction problem that determines the set of valid statistical tests (including numpy [40], scipy [23], and statsmodels [45]), en- and then executes them to test the hypothesis.
    [Show full text]
  • December 2000
    THE ISBA BULLETIN Vol. 7 No. 4 December 2000 The o±cial bulletin of the International Society for Bayesian Analysis A WORD FROM already lays out all the elements mere statisticians might have THE PRESIDENT of the philosophical position anything to say to them that by Philip Dawid that he was to continue to could possibly be worth ISBA President develop and promote (to a listening to. I recently acted as [email protected] largely uncomprehending an expert witness for the audience) for the rest of his life. defence in a murder appeal, Radical Probabilism He is utterly uncompromising which revolved around a Modern Bayesianism is doing in his rejection of the realist variant of the “Prosecutor’s a wonderful job in an enormous conception that Probability is Fallacy” (the confusion of range of applied activities, somehow “out there in the world”, P (innocencejevidence) with supplying modelling, data and in his pragmatist emphasis P ('evidencejinnocence)). $ analysis and inference on Subjective Probability as Contents procedures to nourish parts that something that can be measured other techniques cannot reach. and regulated by suitable ➤ ISBA Elections and Logo But Bayesianism is far more instruments (betting behaviour, ☛ Page 2 than a bag of tricks for helping or proper scoring rules). other specialists out with their What de Finetti constructed ➤ Interview with Lindley tricky problems – it is a totally was, essentially, a whole new ☛ Page 3 original way of thinking about theory of logic – in the broad ➤ New prizes the world we live in. I was sense of principles for thinking ☛ Page 5 forcibly struck by this when I and learning about how the had to deliver some brief world behaves.
    [Show full text]
  • The BUGS Project: Evolution, Critique and Future Directions
    STATISTICS IN MEDICINE Statist. Med. 2009; 28:3049–3067 Published online 24 July 2009 in Wiley InterScience (www.interscience.wiley.com) DOI: 10.1002/sim.3680 The BUGS project: Evolution, critique and future directions 1, , 1 2 3 David Lunn ∗ †, David Spiegelhalter ,AndrewThomas and Nicky Best 1Medical Research Council Biostatistics Unit, Institute of Public Health, University Forvie Site, Robinson Way, Cambridge CB2 0SR, U.K. 2School of Mathematics and Statistics, Mathematical Institute, North Haugh, St. Andrews, Fife KY16 9SS, U.K. 3Department of Epidemiology and Public Health, Imperial College London, St. Mary’s Campus, Norfolk Place, London W2 1PG, U.K. SUMMARY BUGS is a software package for Bayesian inference using Gibbs sampling. The software has been instru- mental in raising awareness of Bayesian modelling among both academic and commercial communities internationally, and has enjoyed considerable success over its 20-year life span. Despite this, the software has a number of shortcomings and a principal aim of this paper is to provide a balanced critical appraisal, in particular highlighting how various ideas have led to unprecedented flexibility while at the same time producing negative side effects. We also present a historical overview of the BUGS project and some future perspectives. Copyright 2009 John Wiley & Sons, Ltd. KEY WORDS: BUGS; WinBUGS; OpenBUGS; Bayesian modelling; graphical models 1. INTRODUCTION BUGS 1 is a software package for performing Bayesian inference using Gibbs sampling 2, 3 . The BUGS[ ] project began at the Medical Research Council Biostatistics Unit in Cambridge in[ 1989.] Since that time the software has become one of the most popular statistical modelling packages, with, at the time of writing, over 30000 registered users of WinBUGS (the Microsoft Windows incarnation of the software) worldwide, and an active on-line community comprising over 8000 members.
    [Show full text]
  • Design and Analysis Issues in Family-Based Association
    Emerging Challenges in Statistical Genetics Duncan Thomas University of Southern California Human Genetics in the Big Science Era • “Big Data” – large n and large p and complexity e.g., NIH Biomedical Big Data Initiative (RFA-HG-14-020) • Large n: challenge for computation and data storage, but not conceptual • Large p: many data mining approaches, few grounded in statistical principles • Sparse penalized regression & hierarchical modeling from Bayesian and frequentist perspectives • Emerging –omics challenges Genetics: from Fisher to GWAS • Population genetics & heritability – Mendel / Fisher / Haldane / Wright • Segregation analysis – Likelihoods on complex pedigrees by peeling: Elston & Stewart • Linkage analysis (PCR / microsats / SNPs) – Multipoint: Lander & Green – MCMC: Thompson • Association – TDT, FBATs, etc: Spielman, Laird – GWAS: Risch & Merikangas – Post-GWAS: pathway mining, next-gen sequencing Association: From hypothesis-driven to agnostic research Candidate pathways Candidate Hierarchical GWAS genes models (ht-SNPs) Ontologies Pathway mining MRC BSU SGX Plans Objectives: – Integrating structural and prior information for sparse regression analysis of high dimensional data – Clustering models for exposure-disease associations – Integrating network information – Penalised regression and Bayesian variable selection – Mechanistic models of cellular processes – Statistical computing for large scale genomics data Targeted areas of impact : – gene regulation and immunological response – biomarker based signatures – targeting
    [Show full text]
  • Stan User's Guide 2.27
    Stan User’s Guide Version 2.27 Stan Development Team Contents Overview 9 Part 1. Example Models 11 1. Regression Models 12 1.1 Linear regression 12 1.2 The QR reparameterization 14 1.3 Priors for coefficients and scales 16 1.4 Robust noise models 16 1.5 Logistic and probit regression 17 1.6 Multi-logit regression 19 1.7 Parameterizing centered vectors 21 1.8 Ordered logistic and probit regression 24 1.9 Hierarchical logistic regression 25 1.10 Hierarchical priors 28 1.11 Item-response theory models 29 1.12 Priors for identifiability 32 1.13 Multivariate priors for hierarchical models 33 1.14 Prediction, forecasting, and backcasting 41 1.15 Multivariate outcomes 42 1.16 Applications of pseudorandom number generation 48 2. Time-Series Models 51 2.1 Autoregressive models 51 2.2 Modeling temporal heteroscedasticity 54 2.3 Moving average models 55 2.4 Autoregressive moving average models 58 2.5 Stochastic volatility models 60 2.6 Hidden Markov models 63 3. Missing Data and Partially Known Parameters 70 1 CONTENTS 2 3.1 Missing data 70 3.2 Partially known parameters 71 3.3 Sliced missing data 72 3.4 Loading matrix for factor analysis 73 3.5 Missing multivariate data 74 4. Truncated or Censored Data 77 4.1 Truncated distributions 77 4.2 Truncated data 77 4.3 Censored data 79 5. Finite Mixtures 82 5.1 Relation to clustering 82 5.2 Latent discrete parameterization 82 5.3 Summing out the responsibility parameter 83 5.4 Vectorizing mixtures 86 5.5 Inferences supported by mixtures 87 5.6 Zero-inflated and hurdle models 90 5.7 Priors and effective data size in mixture models 94 6.
    [Show full text]
  • A Joint Spatio-Temporal Model of Opioid Associated Deaths and Treatment Admissions in Ohio
    A JOINT SPATIO-TEMPORAL MODEL OF OPIOID ASSOCIATED DEATHS AND TREATMENT ADMISSIONS IN OHIO BY YIXUAN JI A Thesis Submitted to the Graduate Faculty of WAKE FOREST UNIVERSITY GRADUATE SCHOOL OF ARTS AND SCIENCES in Partial Fulfillment of the Requirements for the Degree of MASTER OF ARTS Mathematics and Statistics May 2019 Winston-Salem, North Carolina Approved By: Staci Hepler, Ph.D., Advisor Miaohua Jiang, Ph.D., Chair Robert Erhardt, Ph.D. Acknowledgments First and foremost, I would like to express my sincere gratitude and appreciation to my advisor, Dr. Staci A. Hepler. Without your continuous support and inspiration, I could not have achieved this thesis research. Thank you so much for your patience, motivation, enthusiasm, and immense knowledge, which made this research even more enjoyable. In addition to my advisor, I would also like to thank Dr. Miaohua Jiang and Dr. Robert Erhardt, for the constant encouragement and advice, and for serving on my thesis committee. Furthermore, this thesis benefited greatly from various courses that I took, including Bayesian statistics from Dr. Hepler, factor analysis and Markov Chain from Dr. Berenhaut, poisson linear model from Dr. Erhardt's Generalized Linear Model class, R programming skills from Dr. Nicole Dalzell and also reasoning skills from Dr. John Gemmer's Real Analysis class. Additionally, I would like to extend my thanks to Dr. Ellen Kirkman, Dr. Stephen Robinson and Dr. Sarah Raynor for their special attention and support to me, as an international student. Last but not least, I would like to thank my family for their sincere mental support throughout my life.
    [Show full text]
  • Harrison Zhou
    Volume 39 • Issue 3 IMS1935–2010 Bulletin April 2010 Harrison Zhou: Tweedie Award Harrison Zhou receives 2010 IMS Tweedie New Researcher Award Contents The Institute of Mathematical Statistics has selected Harrison Zhou 1 Tweedie Award: Harrison as the winner of this year’s Tweedie New Researcher Award. Dr Zhou Zhou received his PhD in 2004 from Cornell University, and is 2 Members’ News: Iain currently an Associate Professor at Yale University. Johnstone; Ingram Olkin The IMS Travel Awards Committee selected Dr Zhou for “innovative and significant contributions to the theory and 3 Journal of Privacy and methods of nonparametric function estimation; for outstanding Confidentiality; NSF news Harrison Zhou contributions to high-dimensional statistical inference, including 4 Wisconsin Stat Dept estimation of large covariance matrices and sparse signals.” celebrates 50th birthday Dr Zhou said, “I am very much honored and humbled by this award. I am particu- 5 Meeting report: ICCS-X larly happy that the award is in honor of Richard Tweedie, a scholar who made many 6 Obituary: James F Hannan important contributions to our profession, including his extremely generous mentoring of young researchers. I myself have benefited greatly from the advice of my mentors Report: Zacks mini- 7 Michael Nussbaum, Larry Brown, Tony Cai, Mark Low and David Pollard, who have conference helped me understand some pieces of Le Cam’s work, the inspiration for my own 9 Rick’s Ramblings: How to research.” get your paper cited The IMS Tweedie New Researcher Award will fund Dr. Zhou’s travel to present 10 The Renaissance the Tweedie New Researcher Invited Lecture at the Thirteenth IMS Meeting of New Statistician Researchers in Statistics and Probability, held this year in Vancouver, BC, Canada, from 11 Terence’s Stuff: Firing July 27 to 30.
    [Show full text]
  • Invited Conference Speakers
    Medical Research Council Conference on Biostatistics in celebration of the MRC Biostatistics Unit's Centenary Year 24th - 26th March 2014 | Queens' College Cambridge, UK Invited Conference Speakers Professor Tony Ades, School of Social and Community Medicine, University of Bristol Tony Ades is Professor of Public Health Science at the University of Bristol and leads a programme on methods for evidence synthesis in epidemiology and decision modelling. This was originally funded through the MRC Health Services Research Collaboration. With Guobing Lu, Nicky Welton, Sofia DIas, Debbi Caldwell, Malcolm Price, Aicha Goubar and many collaborators in Cambridge and elsewhere, the programme has contributed original research on Network Meta- analysis, multi-parameter synthesis models for the epidemiology of HIV and chlamydia, synthesis for Markov models, expected value of information, and other topics. Tony was a member of the Appraisals Committee at the National Institute for Clinical Excellence (NICE), 2003-2013, and was awarded a lifetime achievement award in 2010 by the Society for Research Synthesis Methodology. Seminar Title: Synthesis of treatment effects, Mappings between outcomes, and test responsiveness Tuesday 25 March, 11.00-12.30 session Abstract: Synthesis of treatment effects, mappings between outcomes, standardisation, and test responsiveness AE Ades, Guobing Lu, Daphne Kounali. School of Social and Community Medicine, University of Bristol Abstract: We shall report on some experiments with a new class of models for synthesis of treatment effect evidence on “similar” outcomes, within and between trials. They are intended particularly for synthesis of patient- and clinician- reported outcomes that are subject to measurement error. These models assume that treatment effects on different outcomes are in fixed, or approximately fixed, ratios across trials.
    [Show full text]
  • Meta-Analysis of Diagnostic Test Data: Modern Statistical Approaches
    Meta-Analysis of Diagnostic Test Data: Modern Statistical Approaches Inaugural Dissertation zur Erlangung des Doktorgrades der Mathematisch-Naturwissenschaftlichen Fakult at¨ der Heinrich-Heine Universit at¨ D usseldorf¨ vorgelegt von Pablo Emilio Verde aus Buenos Aires, Argentinien Juni 2008 Aus der Koordinierungszentrum f ur¨ Klinische Studien Heinrich-Heine Universit at¨ D usseldorf¨ Gedruckt mit der Genehmigung der Mathematisch-Naturwissenschaftlichen Fakult at¨ der Heinrich-Heine-Universit at¨ D usseldorf¨ Referent: Prof. Dr. Christian Ohmann Koreferent: Prof. Dr. Martin Lercher Tag der m undlichen¨ Pr ufung:¨ 08.07.2008 To Isabel, Luc ´ıa, Heide, my Mother and to the memory of my Father. iv Acknowledgments I am very thankful to Christian Ohmann for supporting my work during these years. He has allowed me to work with the intellectual freedom, that I needed to develop and to accomplish this dissertation. I am very grateful to Martin Lercher for welcoming my dissertation, without his support and trust this work would not be possible. This work was partially financed by the Swiss National Science Foundation (51A240-104890). I thank Johannes Siegrist for supporting this work. In the summer of 2007, I had the honor to be invited to teach in the Department of Statistics at Stanford University. During my stay I presented an early version of this work. I am immensely grateful to Brad Efron, Trevor Hastie and Ingram Olkin for discussions and several suggestions to improve my work. I am also grateful to Persi Diaconis for numerous enlightening conversations on Bayesian statistics. My sincere acknowledgement to my professor in Argentina Martin Grondona. He has constantly encouraged me to finalize my dissertation.
    [Show full text]
  • IMS Bulletin 2
    Volume 39 • Issue 10 IMS1935–2010 Bulletin December 2010 Editor’s farewell message Contents Xuming He writes: My term as Editor ends with this very issue. Ten times a year, I 1 Editor’s farewell worried about whether the Bulletin would go out on schedule, both electronic and print versions, and whether enough of our members would find the contents interest- 2 Members’ News: C R Rao; P K Sen; Adrian Smith; Igor ing and informative. Those worries however never turned into headaches. In fact, I Vajda had something to look forward to each time, thanks to our excellent editorial team and enthusiastic contributions from our members. No wonder I have become a little 3 IMS Awards nostalgic as I complete my four years of editorship. 4 Other Awards The Bulletin provides a place to remember the past. We publish obituaries of our 5–7 Obituaries: Julian Besag; Fellows, members, and significant friends of the IMS. Our society is built upon on Hermann Witting; Dan Brunk their legacy. This year, we celebrated the 75th anniversary of the IMS by a number of special articles contributed by some of our senior members, who told us how 7 NSF-CBMS lecturers important the IMS has been to their professional lives. If you missed any, these issues 9 How to Publish: IMS-CUP of the Bulletin can be found at http://bulletin.imstat.org. As Editor, my heart also beats book series with the younger members of our profession. From the profile of new faculty at the 10 Terence’s Stuff: Enduring North Carolina State University in the January/February 2007 issue to our report on values the CAREER awards to four junior faculty members at Georgia Tech in the last issue, I 11 IMS meetings hope that we have made a point that the Bulletin is also a place for the future.
    [Show full text]