Transformed Tests for Homogeneity of Variances and Means

Total Page:16

File Type:pdf, Size:1020Kb

Transformed Tests for Homogeneity of Variances and Means TRANSFORMED TESTS FOR HOMOGENEITY OF VARIANCES AND MEANS Md. Khairul Islam A Dissertation Submitted to the Graduate College of Bowling Green State University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY August 2006 Committee: Hanfeng Chen, Advisor Michael Zickar Graduate Faculty Representative Arjun K. Gupta Truc T. Nguyen Junfeng Shang ii ABSTRACT Hanfeng Chen, Advisor The analysis of variance (ANOVA) is one of the most important and useful techniques for variety of fields such as agriculture, sociology and medicine for comparing different groups or treatments with respect to their means. A set of assumptions such as normal error distribution, homogeneity of variances and independence of observations, has to be made to employ an F test for equality of the treatment means. It is now well established that the violation of the assumption of homogeneity of variances can have severe effects on the inference of the population means, especially in the case of unequal sample sizes. In fact, the conventional ANOVA F provides generally poor control over both Type I and Type II error rates under a wide variety of variance heterogeneity conditions. Therefore, the problem of homogeneity of variances has to be settled before conducting an ANOVA. While a good number of tests are available for testing homogeneity of variances, Bartlett test and four versions of Levene tests are still popular for testing homogeneity of variances in the case of one-way ANOVA setting. It is evident that the Bartlett test is not as robust as Levene tests against the violation of the normality assumption. On the other hand, Levene tests are less powerful than the Bartlett test. In this dissertation, we proposed a transformed version of Bartlett test where the transformation is intended to achieve the normality of the data and independence of the observations to some extent. It is evident from the simulation that the transformed Bartlett test is more robust than the untransformed Bartlett test against the iii violation of the normality assumption. It also follows that the transformed Bartlett test is a balance between the Bartlett test and the four versions of the Levene tests in terms of Type I error rate and power concern. While, the estimation of location parameter is of concern, a modified version of trimmed mean has been proposed as an alternative to trimmed mean when the distribution is skewed or contains outliers. It is evident from the simulation study that modified trimmed mean outperform trimmed mean in terms of coverage probability of interval estimate. It is also evident that the test based on the modified version of trimmed mean is suitable for estimating Type I error rate and is more powerful than the tests based on the mean and the trimmed mean in the presence of the outliers. Finally, an alternative method of estimating transformation parameter has been pro- posed by using the chi-squared goodness of fit criteria which could be useful for testing equality of two population means. iv ACKNOWLEDGMENTS I would like to thank Professor Hanfeng Chen, my supervisor, for his thoughtful suggestions, guidance and constant support during this research. I would also like to express my thanks and gratitude to the members of my committee, Professors Arjun K. Gupta, Truc T. Nguyen, Michael Zickar and Junfeng Shang, for their time and advice. I am grateful to the Department of Mathematics and Statistics for providing me teaching assistantship and a wonderful research environment. My special thanks go to Mary Busdeker, Marcia Seubert and Cyndi Patterson for their assistance and cooperation. I am thankful to Sherwin and Mike for their cooperation and accompaniment during my stay at Bowling Green State University. I certainly have to thank my elder brother Md. Azhar Ali Miah and my sister-in-law who inspired me throughout my education. Without their sacrifice this work could never have come to existence. I also extend my gratitude to my parents-in-law and my children, Sadia and Towfique, for their love and patience what I badly needed to complete my work. Finally, my sincere and deepest gratitude goes to my wife Tanweer Shapla for her encouragement and support throughout my graduate study. Bowling Green, Ohio Md. Khairul Islam August, 2006 v TABLE OF CONTENTS CHAPTER 1: INTRODUCTION AND SUMMARY 1 1.1 Introduction.................................... 1 1.2 TheClassicalModel ............................... 2 1.3 The Violation of Assumptions in Classical Model . ........ 3 1.4 DevelopingtheTransformation . .... 4 1.5 The Box-Cox Transformation and Some Alternative Versions......... 9 1.6 Estimation of λ with Box-Cox Transformed Model . 11 1.7 ThesisSummary ................................. 15 CHAPTER2:TESTINGHOMOGENEITYOFVARIANCES 17 2.1 Introduction.................................... 17 2.2 TestsforHomogeneityofVariances . ..... 20 2.2.1 Bartlett χ2 test.............................. 20 2.2.2 Levene1test ............................... 22 2.2.3 Levene2test ............................... 23 2.2.4 Levene3test ............................... 24 2.2.5 Levene4test ............................... 24 2.2.6 Jackknifetest ............................... 24 2.2.7 BoxTest.................................. 27 2.3 NewTransformedTestProposed . .. 28 vi 2.3.1 Thechoiceoftransformation. .. 31 2.3.2 TransformedBartletttest . 33 2.4 ApplicationstoRealLifeSituations . ....... 33 2.4.1 Example1:Two-samplecase . 34 2.4.2 Example 2: Three-sample case . 36 2.4.3 Example3:Four-samplecase . 39 2.5 MonteCarloStudy ................................ 42 2.5.1 Simulationtechnique . 42 2.5.2 Simulation results and discussions . ..... 43 CHAPTER 3:ESTIMATION BY MODIFIED TRIMMED MEAN 55 3.1 Introduction.................................... 55 3.2 SampleQuantile ................................. 56 3.3 TrimmedMean .................................. 56 3.4 ModifiedTrimmedMean ............................. 57 3.5 Properties of Modified Trimmed Mean . ... 57 3.5.1 Someusefulresults ............................ 57 3.5.2 Asymptotic distribution of modified trimmed mean . ...... 58 3.6 Applications.................................... 60 3.7 MonteCarloStudy ................................ 63 3.7.1 Simulationtechnique . 63 3.7.2 Resultsanddiscussion . 65 CHAPTER4:TRANSFORMEDTESTSFORMEANS 72 vii 4.1 Introduction.................................... 72 4.2 Box-Cox Transformation for Positive Random Variables . ........... 73 4.2.1 Transformed trimmed t-test ....................... 75 4.2.2 Transformed modified trimmed t-test .................. 75 4.3 MonteCarloStudy ................................ 76 4.3.1 Simulationtechnique . 76 4.3.2 Simulationsresults . 77 4.3.3 Concludingremarks............................ 78 CHAPTER 5: TRANSFORMATION BY GOODNESS OF FIT 83 5.1 Introduction.................................... 83 5.2 Estimation of λ byGoodnessofFitTest . .. 85 5.2.1 Motivationforgoodnessoffittest . ... 85 5.2.2 χ2 goodness of fit in estimation of λ ................... 85 5.3 ApplicationstoReallife . ... 87 5.4 SimulationandDiscussion . ... 91 REFERENCES 95 Appendix A: CODE FOR HOMOGENEITY OF VARIANCES 109 A.1 Computation of Power in Three-sample Case . ...... 109 Appendix B: CODE FOR ESTIMATION AND TESTING FOR MEANS 114 B.1 Coverage Probability and Mean Length of CI . ..... 114 B.2 Computation of Power in Two-sample t Tests ................. 115 viii LIST OF TABLES 2.1 Survival times for bile duct cancer patients . ......... 34 2.2 Some descriptive statistics for data in Table 2.1 . .......... 35 2.3 Results of Lilliefors test for data in Table 2.1 . .......... 35 2.4 Different test statistics with corresponding p-values for data in Table 2.1 . 36 2.5 Lifetimes of steel specimens tested at different stress levels .......... 37 2.6 Skewness and sample variances for data in Table 2.5 . ......... 38 2.7 Summary statistics by Lilliefors test for data in Table 2.5........... 38 2.8 Results of different tests with corresponding p-values for data in Table 2.5 . 39 2.9 Times (in hours) between successive failures of air conditioning equipment by fourBoeing720aircrafts . 40 2.10 Some descriptive statistics for data in Table 2.9 . ........... 40 2.11 Summary statistics by Lilliefors test for data in Table 2.9........... 41 2.12 Different tests with corresponding p-valuesfordatainTable2.9 . 42 2.13 Characteristics of distributions used in simulations ............... 44 2.14 Estimated testing size for two-sample cases . .......... 46 2.15 Estimated testing power for variance ratio 1:4 . .......... 48 2.16 Estimated testing size and power for K = 3 and LN(0, 0.5).......... 50 2.17 Estimated testing size and power for K = 3 and B(6, 1.5)........... 51 2.18 Estimated testing size and power for K = 3 and G(2, 1)............ 52 2.19 Estimated testing size and power for K = 3 and Exp(1)............ 53 ix 2 2.20 Estimated testing size and power for K = 3 and χ5 .............. 54 3.1 Lifetimes of transistors in an accelerated life test . ............. 60 3.2 The 95% confidence intervals for transistor data . ......... 61 3.3 Thetimeofappearanceofcarcinomainrats . ...... 62 3.4 The 95% confidence intervals for carcinoma data . ........ 63 3.5 Some characteristics of the distributions used in simulations ......... 64 3.6 Simulated coverage probability and mean length of intervalestimators . 66 3.7 Simulated coverage probability and mean length of interval estimators with constantoutlierinthepopulation . ... 69 4.1 Estimatedsizeofdifferenttests . ..... 79 4.2 Estimatedpowerofdifferenttests . ..... 80 4.3 Estimated
Recommended publications
  • Assessing Heterogeneity in Meta-Analysis: Q Statistic Or I2 Index? Tania Huedo-Medina University of Connecticut, [email protected]
    University of Connecticut OpenCommons@UConn Center for Health, Intervention, and Prevention CHIP Documents (CHIP) 6-1-2006 Assessing heterogeneity in meta-analysis: Q statistic or I2 index? Tania Huedo-Medina University of Connecticut, [email protected] Julio Sanchez-Meca University of Murcia, Spane Fulgencio Marin-Martinez University of Murcia, Spain Juan Botella Autonoma University of Madrid, Spain Follow this and additional works at: https://opencommons.uconn.edu/chip_docs Part of the Psychology Commons Recommended Citation Huedo-Medina, Tania; Sanchez-Meca, Julio; Marin-Martinez, Fulgencio; and Botella, Juan, "Assessing heterogeneity in meta-analysis: Q statistic or I2 index?" (2006). CHIP Documents. 19. https://opencommons.uconn.edu/chip_docs/19 ASSESSING HETEROGENEITY IN META -ANALYSIS: Q STATISTIC OR I2 INDEX? Tania B. Huedo -Medina, 1 Julio Sánchez -Meca, 1 Fulgencio Marín -Martínez, 1 and Juan Botella 2 Running head: Assessing heterogeneity in meta -analysis 2006 1 University of Murcia, Spa in 2 Autónoma University of Madrid, Spain Address for correspondence: Tania B. Huedo -Medina Dept. of Basic Psychology & Methodology, Faculty of Psychology, Espinardo Campus, Murcia, Spain Phone: + 34 968 364279 Fax: + 34 968 364115 E-mail: [email protected] * This work has been supported by Plan Nacional de Investigaci ón Cient ífica, Desarrollo e Innovaci ón Tecnol ógica 2004 -07 from the Ministerio de Educaci ón y Ciencia and by funds from the Fondo Europeo de Desarrollo Regional, FEDER (Proyect Number: S EJ2004 -07278/PSIC). Assessing heterogeneity in meta -analysis 2 ASSESSING HETEROGENEITY IN META -ANALYSIS: Q STATISTIC OR I2 INDEX? Abstract In meta -analysis, the usual way of assessing whether a set of single studies are homogeneous is by means of the Q test.
    [Show full text]
  • Statistical Analysis in JASP
    Copyright © 2018 by Mark A Goss-Sampson. All rights reserved. This book or any portion thereof may not be reproduced or used in any manner whatsoever without the express written permission of the author except for the purposes of research, education or private study. CONTENTS PREFACE .................................................................................................................................................. 1 USING THE JASP INTERFACE .................................................................................................................... 2 DESCRIPTIVE STATISTICS ......................................................................................................................... 8 EXPLORING DATA INTEGRITY ................................................................................................................ 15 ONE SAMPLE T-TEST ............................................................................................................................. 22 BINOMIAL TEST ..................................................................................................................................... 25 MULTINOMIAL TEST .............................................................................................................................. 28 CHI-SQUARE ‘GOODNESS-OF-FIT’ TEST............................................................................................. 30 MULTINOMIAL AND Χ2 ‘GOODNESS-OF-FIT’ TEST. ..........................................................................
    [Show full text]
  • Logistic and Linear Regression Assumptions
    PharmaSUG 2019 - Paper ST-058 Logistic and Linear Regression Assumptions: Violation Recognition and Control Deanna Schreiber-Gregory, Henry M Jackson Foundation Karlen Bader, Henry M Jackson Foundation ABSTRACT Regression analyses are one of the first steps (aside from data cleaning, preparation, and descriptive analyses) in any analytic plan, regardless of plan complexity. Therefore, it is worth acknowledging that the choice and implementation of the wrong type of regression model, or the violation of its assumptions, can have detrimental effects to the results and future directions of any analysis. Considering this, it is important to understand the assumptions of these models and be aware of the processes that can be utilized to test whether these assumptions are being violated. Given that logistic and linear regression techniques are two of the most popular types of regression models utilized today, these are the are the ones that will be covered in this paper. Some Logistic regression assumptions that will reviewed include: dependent variable structure, observation independence, absence of multicollinearity, linearity of independent variables and log odds, and large sample size. For Linear regression, the assumptions that will be reviewed include: linearity, multivariate normality, absence of multicollinearity and auto-correlation, homoscedasticity, and measurement level. This paper is intended for any level of SAS® user. This paper is also written to an audience with a background in theoretical and applied statistics, though the information within will be presented in such a way that any level of statistics/mathematical knowledge will be able to understand the content. INTRODUCTION We can be certain that all parametric tests in a statistical analysis assume some certain characteristics (or assumptions) about the data.
    [Show full text]
  • Comparing the Statistical Tests for Homogeneity of Variances. Zhiqiang Mu East Tennessee State University
    East Tennessee State University Digital Commons @ East Tennessee State University Electronic Theses and Dissertations Student Works 8-2006 Comparing the Statistical Tests for Homogeneity of Variances. Zhiqiang Mu East Tennessee State University Follow this and additional works at: https://dc.etsu.edu/etd Part of the Statistical Theory Commons Recommended Citation Mu, Zhiqiang, "Comparing the Statistical Tests for Homogeneity of Variances." (2006). Electronic Theses and Dissertations. Paper 2212. https://dc.etsu.edu/etd/2212 This Thesis - Open Access is brought to you for free and open access by the Student Works at Digital Commons @ East Tennessee State University. It has been accepted for inclusion in Electronic Theses and Dissertations by an authorized administrator of Digital Commons @ East Tennessee State University. For more information, please contact [email protected]. Comparing the Statistical Tests for Homogeneity of Variances _______________________________ A thesis presented to the faculty of the Department of Mathematics East Tennessee State University In partial fulfillment of the requirements for the degree Master of Science in Mathematical Sciences _______________________________ by Zhiqiang Mu August, 2006 _______________________________ Edith Seier, PhD., Chair Robert Price, PhD. Yali Liu, PhD. Key words: Equality of Variances, Robust, ANOVA, Resampling, Bootstrap Test, Permutation Test ABSTRACT Comparing the Statistical Tests for Homogeneity of Variances by Zhiqiang Mu Testing the homogeneity of variances is an important problem in many applications since statistical methods of frequent use, such as ANOVA, assume equal variances for two or more groups of data. However, testing the equality of variances is a difficult problem due to the fact that many of the tests are not robust against non-normality.
    [Show full text]
  • Distribution-Free Analysis of Homogeneity
    Distribution-free Analysis of Homogeneity Dissertation Max Wornowizki TU Dortmund University Faculty of Statistics 14.08.2015 Supervisor: Prof. Dr. Roland Fried CONTENTS 1 Contents 1 Introduction 4 2 Demixing Empirical Distribution Functions 9 2.1 Motivation . .9 2.2 Problem Definition . 12 2.3 Demixing Algorithm . 18 2.3.1 Main Algorithm . 18 2.3.2 Shrink-Down Algorithm . 22 2.3.3 Shrink-Up Algorithm . 22 2.3.4 Binary Search Algorithm . 24 2.3.5 Normalisation Algorithm . 25 2.4 Theoretical Analysis . 28 2.4.1 Correctness of the Algorithm . 28 2.4.2 Runtime Analysis . 40 2.5 Practical Evaluation . 41 2.5.1 Performance and Runtime . 43 2.5.2 Estimated Shrinkage Factors under the Null Hypothesis . 47 2.5.3 Application to Astrophysical Data . 48 2.5.4 Application to Bioinformatical Data . 51 2.6 Related Methods . 54 3 Two-sample Tests based on Divergences 58 3.1 Divergence Measures . 59 3.2 Divergence Estimation . 61 2 CONTENTS 3.2.1 Density Ratio Estimation . 61 3.2.2 Divergence Estimation using Density Ratio Estimates . 65 3.2.3 Comparison of the Divergence Estimators . 68 3.3 Testing Homogeneity based on Divergences . 71 3.3.1 Divergence-based Tests . 71 3.3.2 Comparison of Divergence-based Tests . 73 3.3.3 Application to Biometrical Data . 79 3.4 Conclusions and Extensions . 80 4 Testing Time Series for a Constant Volatility 84 4.1 Framework . 84 4.2 Test Statistics for Volatility Changes . 86 4.3 Testing for a Global Constant Volatility . 90 4.3.1 Testing Procedure .
    [Show full text]
  • Meta-Analysis: Heterogeneity and Publication Bias
    Department of Health Sciences M.Sc. Module: Systematic Reviews Meta-analysis: heterogeneity and publication bias Martin Bland Professor of Health Statistics University of York http://martinbland.co.uk/msc/ Heterogeneity Galbraith plots Meta-regression Random effects models Publication bias Funnel plots Begg and Eggar tests Trim and fill Selection modelling Meta-regression Heterogeneity Studies differ in terms of – Patients – Interventions – Outcome definitions – Design Clinical heterogeneity – Variation in true treatment or risk factor effects in magnitude or direction Statistical heterogeneity 1 Heterogeneity Statistical heterogeneity may be caused by – clinical differences between studies – methodological differences between studies – unknown study characteristics Even if studies are clinically homogeneous there may be statistical heterogeneity Heterogeneity How to identify statistical heterogeneity Test the null hypothesis that the studies all have the same effect in the population. The test looks at the differences between observed effects for the studies and the pooled effect estimate. Square, divide by variance, sum. This gives a chi-squared test with degrees of freedom = number of studies – 1. Expected chi-squared if null hypothesis true = degrees of freedom. Heterogeneity Test for heterogeneity: 2 = 4.91, df = 2, P=0.086 2 Heterogeneity Heterogeneity not significant No statistical evidence for difference between studies But, test for heterogeneity has low power - the number of studies is usually low - and may fail
    [Show full text]
  • Package 'Homtest'
    Package ‘homtest’ February 20, 2015 Version 1.0-5 Date 2009-03-26 Title Homogeneity tests for Regional Frequency Analysis Author Alberto Viglione Maintainer Alberto Viglione <[email protected]> Description A collection of homogeneity tests described in: Viglione A., Laio F., Claps P. (2007) ``A comparison of homogeneity tests for regional frequency analysis'', Water Resources Research, 43, W03428, doi:10.1029/2006WR005095. More on Regional Frequency Analysis can be found in package nsRFA. Depends stats License GPL (>= 2) URL http://www.idrologia.polito.it/~alviglio Repository CRAN Date/Publication 2012-11-05 10:30:42 NeedsCompilation no R topics documented: annualflows . .2 GEV.............................................2 GUMBEL . .4 HOMTESTS . .6 KAPPA ........................................... 11 Lmoments . 14 Index 16 1 2 GEV annualflows Data-sample Description Total annual flow, expressed in mm, of 47 stations in Piemonte (Italy). Usage annualflows Format Data.frame containing annual flow data of 47 stations. Examples data(annualflows) annualflows summary(annualflows) x <- annualflows["dato"][,] cod <- annualflows["cod"][,] split(x,cod) sapply(split(x,cod),mean) sapply(split(x,cod),median) sapply(split(x,cod),quantile) sapply(split(x,cod),Lmoments) GEV Three parameter generalized extreme value distribution and L- moments Description GEV provides the link between L-moments of a sample and the three parameter generalized extreme value distribution. Usage f.GEV (x, xi, alfa, k) F.GEV (x, xi, alfa, k) invF.GEV (F, xi, alfa, k)
    [Show full text]
  • ANOVA Assumptions
    ANOVA Assumptions “It is the mark of a truly intelligent person to be moved by statistics” George Bernard Shaw (co-founder of the London School of Economics) ANOVA Assumptions 1. The experimental errors of your data are normally distributed 2. Equal variances between treatments Homogeneity of variances Homoscedasticity 3. Independence of samples Each sample is randomly selected and independent Assumption #1: Experimental errors are normally distributed “If I was to repeat my sample repeatedly and calculate the means, those means would be normally distributed.” Determine this by looking at the residuals of your sample: residuals : subtract overall mean from the sample means Residuals FARM 1 C B 푥 A 퐴퐵퐶퐹푎푟푚1 508 514.25 583.25 727.5 푥 푥 푥 푐퐹푎푟푚1 퐵퐹푎푟푚1 퐴퐹푎푟푚1 Calculate residuals in R: res = residuals(lm(YIELD~VARIETY)) One-Way ANOVA model=aov(YIELD~VARIETY) #Build a model with the normal ANOVA command res=model$residuals #Create an object of the residuals of Y Assumption #1: Experimental errors are normally distributed Testing for Normality – Shapiro Wilks Test Tests the hypotheses: 퐻푂: 푑푖푠푡푟푖푏푢푡푖표푛 표푓 푟푒푠푖푑푢푎푙푠 = 푛표푟푚푎푙 푑푖푠푡푟푖푏푢푡푖표푛 퐻푎: 푑푖푠푡푟푖푏푢푡푖표푛 표푓 푟푒푠푖푑푢푎푙푠 ≠ 푛표푟푚푎푙 푑푖푠푡푟푖푏푢푡푖표푛 Non-Significant p-value = NORMAL distribution 푛 2 푎푖= constants generated from the means, variances 푖=1 푎푖푥 푖 and covariances of the order statistics of a sample of 푊 = 푛 2 푖=1 푥푖 − 푥 size n from a normal distribution (complex) 푥 푖 = ordered sample values (x(1) is the smallest) Small values of W are evidence of departure from normality Shapiro-Wilks Test in
    [Show full text]
  • Meta-Analysis Fixed Effect Vs. Random Effects
    Meta-Analysis Fixed effect vs. random effects Michael Borenstein Larry Hedges Hannah Rothstein www.Meta-Analysis.com © 2007 Borenstein, Hedges, Rothstein | 1 Section: Fixed effect vs. random effects models..................................................4 Overview ...........................................................................................................4 Fixed effect model.............................................................................................6 Definition of a combined effect.......................................................................6 Assigning weights to the studies....................................................................6 Random effects model ....................................................................................11 Definition of a combined effect.....................................................................11 Assigning weights to the studies..................................................................12 Fixed effect vs. random effects models ...........................................................22 The concept.................................................................................................22 Definition of a combined effect.....................................................................22 Computing the combined effect ...................................................................23 Extreme effect size in large study................................................................23 Extreme effect size in small study................................................................25
    [Show full text]
  • Non Parametric L-Moment Homogeneity Test
    Fast and direct nonparametric procedures in the L-moment homogeneity test Pierre Masselot1*, Fateh Chebana1, Taha B.M.J. Ouarda1,2 February, 2015 1Centre Eau-Terre-Environnement (ETE), Institut national de la recherche scientifique (INRS), 490 de la Couronne, Québec, QC, G1K 9A9, Canada 2 Institute center for water and environment (iWATER), Masdar institute of science and technology, PO Box 54224, Abu Dhabi, United Arab Emirates * Corresponding author : [email protected] 1 Abstract Regional frequency analysis is an important tool to properly estimate hydrological characteristics at ungauged or partially gauged sites in order to prevent hydrological disasters. The delineation of homogeneous groups of sites is an important first step in order to transfer information and obtain accurate quantile estimates at the target site. The Hosking-Wallis homogeneity test is usually used to test the homogeneity of the selected sites. Despite its usefulness and good power, it presents some drawbacks including the subjective choice of a parametric distribution for the data and a poorly justified rejection threshold. The present paper addresses these drawbacks by integrating nonparametric procedures in the L-moment homogeneity test. To assess the rejection threshold, three resampling methods (permutation, bootstrap and Pólya resampling) are considered. Results indicate that permutation and bootstrap methods perform better than the parametric Hosking-Wallis test in terms of power as well as in time and procedure simplicity. A real-world case study shows that the nonparametric tests agree with the HW test concerning the homogeneity of the volume and the bivariate case while they disagree for the peak case, but that the assumptions of the HW test are not well respected.
    [Show full text]
  • Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy
    Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy Chapter 10 Analysing and Presenting Results Petra Macaskill, Constantine Gatsonis, Jonathan Deeks, Roger Harbord, Yemisi Takwoingi. Version 1.0 Released December 23rd 2010. ©The Cochrane Collaboration Please cite this version as: Macaskill P, Gatsonis C, Deeks JJ, Harbord RM, Takwoingi Y. Chapter 10: Analysing and Presenting Results. In: Deeks JJ, Bossuyt PM, Gatsonis C (editors), Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy Version 1.0. The Cochrane Collaboration, 2010. Available from: http://srdta.cochrane.org/. Saved date and time 23/12/2010 15:08 Jon Deeks 1 | Page Contents 10.1 Introduction ....................................................................................................................................................... 4 10.1.1 Aims of meta-analysis for DTA reviews....................................................................................................... 4 10.1.2 When not to use a meta-analysis in a review ................................................................................................ 5 10.1.3 How does meta-analysis of diagnostic test accuracy differ from meta-analysis of interventions? ................ 5 10.1.4 Questions which can be addressed in DTA analyses .................................................................................... 6 10.1.4.1 What is the accuracy of a test? ...........................................................................................................
    [Show full text]
  • If Assumptions of Normality, Homogeneity of Variance, and Inde- Pendence -Of Errors Are Met
    EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT 1974, 34, 789-799. AN EMPIRICAL COMPARISON OF THE ANOVA F-TEST, NORMAL SCORES TEST AND KRUSKAL-WALLIS TEST UNDER VIOLATION OF ASSUMPTIONS BETTY J. FEIR-WALSH University of South Carolina LARRY E. TOOTHAKER University of Oklahoma The present research compares the ANOVA F-test, the Kruskal- Wallis test, and the normal scores test in terms of empirical alpha and empirical power with samples from the normal distribution and two exponential distributions. Empirical evidence supports the use of the ANOVA F-test even under violation of assumptions when testing hypotheses about means. If the researcher is willing to test hypotheses about medians, the Kruskal-Wallis test was found to be competitive to the F-test. However, in the cases investigated, the normal scores test was not consistently better than the F-test or the Kruskal-Wallis test and could not be recom- mended on the basis of this research. A common problem in applied research is to decide whether or not sample differences in central tendency reflect true differences in parent populations. It is appropriate to use the one-way fixed effects ANOVA F-test for the k-sample case (two or more groups) if assumptions of normality, homogeneity of variance, and inde- pendence -of errors are met. When normality and/or equality of vari- ance are doubtful, current literature recommends the use of non- parametric statistical procedures. Two nonparametric counterparts to F are the Kruskal-Wallis rank test (Kruskal, 1952) and the expected normal scores test, which used normalized observations in the place of ranks (McSweeney and Penfield, 1969).
    [Show full text]