Multiple Testing in Statistical Analysis of Systems-Based Information Retrieval Experiments
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Sequential Analysis Tests and Confidence Intervals
D. Siegmund Sequential Analysis Tests and Confidence Intervals Series: Springer Series in Statistics The modern theory of Sequential Analysis came into existence simultaneously in the United States and Great Britain in response to demands for more efficient sampling inspection procedures during World War II. The develop ments were admirably summarized by their principal architect, A. Wald, in his book Sequential Analysis (1947). In spite of the extraordinary accomplishments of this period, there remained some dissatisfaction with the sequential probability ratio test and Wald's analysis of it. (i) The open-ended continuation region with the concomitant possibility of taking an arbitrarily large number of observations seems intol erable in practice. (ii) Wald's elegant approximations based on "neglecting the excess" of the log likelihood ratio over the stopping boundaries are not especially accurate and do not allow one to study the effect oftaking observa tions in groups rather than one at a time. (iii) The beautiful optimality property of the sequential probability ratio test applies only to the artificial problem of 1985, XI, 274 p. testing a simple hypothesis against a simple alternative. In response to these issues and to new motivation from the direction of controlled clinical trials numerous modifications of the sequential probability ratio test were proposed and their properties studied-often Printed book by simulation or lengthy numerical computation. (A notable exception is Anderson, 1960; see III.7.) In the past decade it has become possible to give a more complete theoretical Hardcover analysis of many of the proposals and hence to understand them better. ▶ 119,99 € | £109.99 | $149.99 ▶ *128,39 € (D) | 131,99 € (A) | CHF 141.50 eBook Available from your bookstore or ▶ springer.com/shop MyCopy Printed eBook for just ▶ € | $ 24.99 ▶ springer.com/mycopy Order online at springer.com ▶ or for the Americas call (toll free) 1-800-SPRINGER ▶ or email us at: [email protected]. -
Trial Sequential Analysis: Novel Approach for Meta-Analysis
Anesth Pain Med 2021;16:138-150 https://doi.org/10.17085/apm.21038 Review pISSN 1975-5171 • eISSN 2383-7977 Trial sequential analysis: novel approach for meta-analysis Hyun Kang Received April 19, 2021 Department of Anesthesiology and Pain Medicine, Chung-Ang University College of Accepted April 25, 2021 Medicine, Seoul, Korea Systematic reviews and meta-analyses rank the highest in the evidence hierarchy. However, they still have the risk of spurious results because they include too few studies and partici- pants. The use of trial sequential analysis (TSA) has increased recently, providing more infor- mation on the precision and uncertainty of meta-analysis results. This makes it a powerful tool for clinicians to assess the conclusiveness of meta-analysis. TSA provides monitoring boundaries or futility boundaries, helping clinicians prevent unnecessary trials. The use and Corresponding author Hyun Kang, M.D., Ph.D. interpretation of TSA should be based on an understanding of the principles and assump- Department of Anesthesiology and tions behind TSA, which may provide more accurate, precise, and unbiased information to Pain Medicine, Chung-Ang University clinicians, patients, and policymakers. In this article, the history, background, principles, and College of Medicine, 84 Heukseok-ro, assumptions behind TSA are described, which would lead to its better understanding, imple- Dongjak-gu, Seoul 06974, Korea mentation, and interpretation. Tel: 82-2-6299-2586 Fax: 82-2-6299-2585 E-mail: [email protected] Keywords: Interim analysis; Meta-analysis; Statistics; Trial sequential analysis. INTRODUCTION termined number of patients was reached [2]. A systematic review is a research method that attempts Sequential analysis is a statistical method in which the fi- to collect all empirical evidence according to predefined nal number of patients analyzed is not predetermined, but inclusion and exclusion criteria to answer specific and fo- sampling or enrollment of patients is decided by a prede- cused questions [3]. -
Detecting Reinforcement Patterns in the Stream of Naturalistic Observations of Social Interactions
Portland State University PDXScholar Dissertations and Theses Dissertations and Theses 7-14-2020 Detecting Reinforcement Patterns in the Stream of Naturalistic Observations of Social Interactions James Lamar DeLaney 3rd Portland State University Follow this and additional works at: https://pdxscholar.library.pdx.edu/open_access_etds Part of the Developmental Psychology Commons Let us know how access to this document benefits ou.y Recommended Citation DeLaney 3rd, James Lamar, "Detecting Reinforcement Patterns in the Stream of Naturalistic Observations of Social Interactions" (2020). Dissertations and Theses. Paper 5553. https://doi.org/10.15760/etd.7427 This Thesis is brought to you for free and open access. It has been accepted for inclusion in Dissertations and Theses by an authorized administrator of PDXScholar. Please contact us if we can make this document more accessible: [email protected]. Detecting Reinforcement Patterns in the Stream of Naturalistic Observations of Social Interactions by James Lamar DeLaney 3rd A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Psychology Thesis Committee: Thomas Kindermann, Chair Jason Newsom Ellen A. Skinner Portland State University i Abstract How do consequences affect future behaviors in real-world social interactions? The term positive reinforcer refers to those consequences that are associated with an increase in probability of an antecedent behavior (Skinner, 1938). To explore whether reinforcement occurs under naturally occuring conditions, many studies use sequential analysis methods to detect contingency patterns (see Quera & Bakeman, 1998). This study argues that these methods do not look at behavior change following putative reinforcers, and thus, are not sufficient for declaring reinforcement effects arising in naturally occuring interactions, according to the Skinner’s (1938) operational definition of reinforcers. -
HO Hartley Publications by Year
APPENDIX H.O. Hartley Publications by Year (1935-1980) BOOKS Pearson, ES; Hartley, HO. Biomtrika Tables for Statisticians, Vol. I. Biometrika Publications, Cambridge University Press, Bentley House, London, 1954 Greenwood, JA; Hartley, HO. Guide to Tables in Mathematical Statistics, Princeton University Press, Princeton, New Jersey, 1961 Pearson, ES; Hartley, HO. Biomtrika Tables for Statisticians, Vol. II. Biometrika Publications, Cambridge University Press, Bentley House, London, 1972 PEER- REVIEWED PUBLISHED MANUSCRIPTS 1935 *Hirshchfeld, HO. A Connection between Correlation and Contingency. Proceedings of the Cambridge Philosophical Society (Math. Proc.) 31: 520-524 1936 *Hirschfeld, HO. A Generalization of Picard’s Method of Successive Approximation. Proceedings of the Cambridge Philosophical Society 32: 86-95 *Hirschfeld, HO. Continuation of Differentiable Functions through the Plane. The Quarterly Journal of Mathematics, Oxford Series, 7: 1-15 Wishart, J; *Hirschfeld, HO. A Theorem Concerning the Distribution of Joins between Line Segments. Journal of the London Mathematical Society, 11: 227-235 1937 *Hirschfeld, HO. The Distribution of the Ratio of Covariance Estimates in Two Samples Drawn from Normal Bivariate Populations. Biometrika 29: 65-79 *Manuscripts dated from 1935 to 1937 are listed under born name of Hirschfeld, Hermann Otto. Manuscripts dated from 1938 -1980 are listed under changed name of Hartley, Herman Otto. 1938 Hunter, H; Hartley, HO. Relation of Ear Survival to the Nitrogen Content of Certain Varieties of Barley – With a Statistical Study. Journal of Agricultural Science 28: 472-502 Hartley, HO. Studentization and Large Sample Theory. Supplement to the Journal of the Royal Statistical Society 5: 80-88 1940 Hartley, HO. Testing the Homogeneity of a Set of Variances. -
A Simple but Accurate Excel User-Defined Function to Calculate Studentized Range Quantiles for Tukey Tests
A Simple but Accurate Excel United States Department of User-Defined Function to Agriculture Calculate Studentized Range Agricultural Quantiles for Tukey Tests Research Service K. Thomas Klasson Southern Regional Research Center Technical Report June 2019 Revised February 2020 The Agricultural Research Service (ARS) is the U.S. Department of Agriculture's chief scientific in-house research agency. Our job is finding solutions to agricultural problems that affect Americans every day from field to table. ARS conducts research to develop and transfer solutions to agricultural problems of high national priority and provide information access and dissemination of its research results. The U.S. Department of Agriculture (USDA) prohibits discrimination in all its programs and activities on the basis of race, color, national origin, age, disability, and where applicable, sex, marital status, familial status, parental status, religion, sexual orientation, genetic information, political beliefs, reprisal, or because all or part of an individual's income is derived from any public assistance program. (Not all prohibited bases apply to all programs.) Persons with disabilities who require alternative means for communication of program information (Braille, large print, audiotape, etc.) should contact USDA's TARGET Center at (202) 720-2600 (voice and TDD). To file a complaint of discrimination, write to USDA, Director, Office of Civil Rights, 1400 Independence Avenue, S.W., Washington, D.C. 20250-9410, or call (800) 795-3272 (voice) or (202) 720-6382 (TDD). USDA is an equal opportunity provider and employer. K. Thomas Klasson is a Supervisory Chemical Engineer at USDA-ARS, Southern Regional Research Center, 1100 Robert E. Lee Boulevard, New Orleans, LA 70124; email: [email protected] A Simple but Accurate Excel User-Defined Function to Calculate Studentized Range Quantiles for Tukey Tests K. -
Studentized Range Upper Quantiles
A Table of Upper Quantile Points of the Studentized Range Distribution This is a table of upper quantile points of the studentized range distribution, values that are required by several statistical techniques including the Tukey wsd and the Newman-Keuls multiple comparison procedures. The table entries are given to four significant digits, in the style of the classic table for this distribution (Harter, 1960; Harter & Clemm, 1959). 2 Suppose that X(1) and X(r) are the smallest and largest order statistics in r independent N(µ, σ ) random variables. That is, X(1) and X(r) are the minimum and maximum values in a simple random sample of size r from the normal distribution with mean µ and variance σ2. The (externally) studentized range is a variable of the form X − X Q = (r) (1) S 2 2 2 where νS /σ is a χ (ν) random variable that is independent of X(r) − X(1). The p-th quantile qp(r; ν)ofthe distribution of Q satisfies Pr(Q ≤ qp(r; ν)) = p; where 0 <p<1: That is, the area to the right of qp(r; ν) under the density function of Q is 1 − p. This table includes 4944 values of qp(r; ν), at all combinations of the following parameters: • p =0:50; 0:75; 0:90; 0:95; 0:975; 0:99; 0:995; 0:999 • r = 2(1)20; 30; 40(20)100 • ν = 1(1)20; 24; 30; 40; 60; 120; ∞ except that no values are provided for ν =1atp =0:50 or p =0:75. -
Sequential Analysis Exercises
Sequential analysis Exercises : 1. What is the practical idea at origin of sequential analysis ? 2. In which cases sequential analysis is appropriate ? 3. What are the supposed advantages of sequential analysis ? 4. What does the values from a three steps sequential analysis in the following table are for ? nb accept refuse grains e xamined 20 0 5 40 2 7 60 6 7 5. Do I have alpha and beta risks when I use sequential analysis ? ______________________________________________ 1 5th ISTA seminar on statistics S Grégoire August 1999 SEQUENTIAL ANALYSIS.DOC PRINCIPLE OF THE SEQUENTIAL ANALYSIS METHOD The method originates from a practical observation. Some objects are so good or so bad that with a quick look we are able to accept or reject them. For other objects we need more information to know if they are good enough or not. In other words, we do not need to put the same effort on all the objects we control. As a result of this we may save time or money if we decide for some objects at an early stage. The general basis of sequential analysis is to define, -before the studies-, sampling schemes which permit to take consistent decisions at different stages of examination. The aim is to compare the results of examinations made on an object to control <-- to --> decisional limits in order to take a decision. At each stage of examination the decision is to accept the object if it is good enough with a chosen probability or to reject the object if it is too bad (at a chosen probability) or to continue the study if we need more information before to fall in one of the two above categories. -
Sequential and Multiple Testing for Dose-Response Analysis
Drug Information Journal, Vol. 34, pp. 591±597, 2000 0092-8615/2000 Printed in the USA. All rights reserved. Copyright 2000 Drug Information Association Inc. SEQUENTIAL AND MULTIPLE TESTING FOR DOSE-RESPONSE ANALYSIS WALTER LEHMACHER,PHD Professor, Institute for Medical Statistics, Informatics and Epidemiology, University of Cologne, Germany MEINHARD KIESER,PHD Head, Department of Biometry, Dr. Willmar Schwabe Pharmaceuticals, Karlsruhe, Germany LUDWIG HOTHORN,PHD Professor, Department of Bioinformatics, University of Hannover, Germany For the analysis of dose-response relationship under the assumption of ordered alterna- tives several global trend tests are available. Furthermore, there are multiple test proce- dures which can identify doses as effective or even as minimally effective. In this paper it is shown that the principles of multiple comparisons and interim analyses can be combined in flexible and adaptive strategies for dose-response analyses; these procedures control the experimentwise error rate. Key Words: Dose-response; Interim analysis; Multiple testing INTRODUCTION rate in a strong sense) are necessary. The global and multiplicity aspects are also re- THE FIRST QUESTION in dose-response flected by the actual International Confer- analysis is whether a monotone trend across ence for Harmonization Guidelines (1) for dose groups exists. This leads to global trend dose-response analysis. Appropriate order tests, where the global level (familywise er- restricted tests can be confirmatory for dem- ror rate in a weak sense) is controlled. In onstrating a global trend and suitable multi- clinical research, one is additionally inter- ple procedures can identify a recommended ested in which doses are effective, which dose. minimal dose is effective, or which dose For the global hypothesis, there are sev- steps are effective, in order to substantiate eral suitable trend tests. -
Application of Range in the Overall Analysis of Variance
TOE APPLICATION OF RANGE IN TOE OTERALL ANALYSIS OF VARIANCE lor BARREL LEE EKLUND B, S,, Kansas State Qilvtersltj, I96I1 A MASTER'S REPORT submitted in partial fulfillment of the requirements for the degree MASTER OF SCIENCE Department of Statistics KANSAS STATE UNIVERSITT Manhattan, Kansas 1966 Approved by: Jig^^^&e^;css^}^Li<<>:<^ LP M^ ii r y. TABLE OF CONTENTS INTRODUCTION 1 HISTORICAL REVIEW AND RATIONALE 1 Developments in the use of Range • • 1 ^proximation to the Distribution of Mean Range. ••• 2 Distribution of the Range Ratio. ....•..•••.••.•. U niscussion of Sanple Range •• .......... U COMPLETELY RANDOMIZED DE?IGN 5 Theoretical Basis. • ....•.•... $ Illustration of Procedure 6 RANDOMIZED COMPLETE BLOCK DESIGN 7 Distaribution Theory for the Randomized Block Analysis by Range . 7 'Studentized' Range Test as a Substitute for the F-test 8 Illustration of Procedure. ..•.......•....•••. 9 COMPLETELY RANDOMIZED DESIGN WITH CELL REPLICATION AND FACTORIAL ARRANGEMENT OF TREATI-IENTS 11 P^ed Model 12 Random Ifodel 13 Illustration of Procedure , 11^ RANDOMIZED CCMPLE'lTi; BLOCK DESIGN WITH FACTORIAL ARRANGEMENT OF TREATIIEOTS I7 Theoretical Basis. I7 Illustration of Procedure. ............ I9 SPLIT-PLOT DESIGN 21 Hiaeoretical Basis 21 Illustration of Procedure , 23 COMPLETKLY RANDOMIZED DESIGN WITH UNEQUAL Cmi. HffiQUENCIES 26 iii MULTIPLE COMPARISONS 28 Multiple Q-test 28 Stepwise Reduction of the q-test • .....30 POWER OF THE RANQE TEST IN EIEMENTARY DESIGNS .30 Random Model. .•• ••••••• 31 Fixed Model • 33 SUI-IMARY 33 ACKNCKLEDGEMENT 3$ REFERENCES • 36 APPENDIX ••• ..*37 ) THE APPLICAnON OF RANGE IN THE OVERALL ANALYSIS OF VARIANCE INTRODUCTION One of the best known estimators of the variation within a sample is the sajiple range. -
Dataset Decay and the Problem of Sequential Analyses on Open Datasets
FEATURE ARTICLE META-RESEARCH Dataset decay and the problem of sequential analyses on open datasets Abstract Open data allows researchers to explore pre-existing datasets in new ways. However, if many researchers reuse the same dataset, multiple statistical testing may increase false positives. Here we demonstrate that sequential hypothesis testing on the same dataset by multiple researchers can inflate error rates. We go on to discuss a number of correction procedures that can reduce the number of false positives, and the challenges associated with these correction procedures. WILLIAM HEDLEY THOMPSON*, JESSEY WRIGHT, PATRICK G BISSETT AND RUSSELL A POLDRACK Introduction sequential procedures correct for the latest in a In recent years, there has been a push to make non-simultaneous series of tests. There are sev- the scientific datasets associated with published eral proposed solutions to address multiple papers openly available to other researchers sequential analyses, namely a-spending and a- (Nosek et al., 2015). Making data open will investing procedures (Aharoni and Rosset, allow other researchers to both reproduce pub- 2014; Foster and Stine, 2008), which strictly lished analyses and ask new questions of existing control false positive rate. Here we will also pro- datasets (Molloy, 2011; Pisani et al., 2016). pose a third, a-debt, which does not maintain a The ability to explore pre-existing datasets in constant false positive rate but allows it to grow new ways should make research more efficient controllably. and has the potential to yield new discoveries Sequential correction procedures are harder *For correspondence: william. (Weston et al., 2019). to implement than simultaneous procedures as [email protected] The availability of open datasets will increase they require keeping track of the total number over time as funders mandate and reward data Competing interests: The of tests that have been performed by others. -
The Combination of Statistical Tests of Significance Wendell Carl Smith Iowa State University
Iowa State University Capstones, Theses and Retrospective Theses and Dissertations Dissertations 1974 The combination of statistical tests of significance Wendell Carl Smith Iowa State University Follow this and additional works at: https://lib.dr.iastate.edu/rtd Part of the Statistics and Probability Commons Recommended Citation Smith, Wendell Carl, "The ombc ination of statistical tests of significance " (1974). Retrospective Theses and Dissertations. 5120. https://lib.dr.iastate.edu/rtd/5120 This Dissertation is brought to you for free and open access by the Iowa State University Capstones, Theses and Dissertations at Iowa State University Digital Repository. It has been accepted for inclusion in Retrospective Theses and Dissertations by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. INFORMATION TO USERS This material was produced from a microfilm copy of the original document. While the most advanced technological means to photograph and reproduce this document have been used, the quality is heavily dependent upon the quality of the original submitted. The following explanation of techniques is provided to help you understand markings or patterns which may appear on this reproduction. 1. The sign or "target" for pages apparently lacking from the document photographed is "Missing Page(s)". If it was possible to obtain the missing page(s) or section, they are spliced into the film along with adjacent pages. This may have necessitated cutting thru an image and duplicating adjacent pages to insure you complete continuity. 2. When an image on the film is obliterated with a large round black mark, it is an indication that the photographer suspected that the copy may have moved during exposure and thus cause a blurred image. -
773593683.Pdf
LAKE AREA CHANGE IN ALASKAN NATIONAL WILDLIFE REFUGES: MAGNITUDE, MECHANISMS, AND HETEROGENEITY A DISSERTATION Presented to the Faculty of the University of Alaska Fairbanks In Partial Fulfillment of the Requirements for the Degree of DOCTOR OF PHILOSOPHY By Jennifer Roach, B.S. Fairbanks, Alaska December 2011 iii Abstract The objective of this dissertation was to estimate the magnitude and mechanisms of lake area change in Alaskan National Wildlife Refuges. An efficient and objective approach to classifying lake area from Landsat imagery was developed, tested, and used to estimate lake area trends at multiple spatial and temporal scales for ~23,000 lakes in ten study areas. Seven study areas had long-term declines in lake area and five study areas had recent declines. The mean rate of change across study areas was -1.07% per year for the long-term records and -0.80% per year for the recent records. The presence of net declines in lake area suggests that, while there was substantial among-lake heterogeneity in trends at scales of 3-22 km a dynamic equilibrium in lake area may not be present. Net declines in lake area are consistent with increases in length of the unfrozen season, evapotranspiration, and vegetation expansion. A field comparison of paired decreasing and non-decreasing lakes identified terrestrialization (i.e., expansion of floating mats into open water with a potential trajectory towards peatland development) as the mechanism for lake area reduction in shallow lakes and thermokarst as the mechanism for non-decreasing lake area in deeper lakes. Consistent with this, study areas with non-decreasing trends tended to be associated with fine-grained soils that tend to be more susceptible to thermokarst due to their higher ice content and a larger percentage of lakes in zones with thermokarst features compared to study areas with decreasing trends.