Sweating the Small Stuff: Does Data Cleaning and Testing of Assumptions Really Matter in the 21St Century?
Total Page:16
File Type:pdf, Size:1020Kb
SWEATING THE SMALL STUFF: DOES DATA CLEANING AND TESTING OF ASSUMPTIONS REALLY MATTER IN THE 21ST CENTURY? Topic Editor Jason W. Osborne PSYCHOLOGY FRONTIERS COPYRIGHT STATEMENT ABOUT FRONTIERS © Copyright 2007-2013 Frontiers is more than just an open-access publisher of scholarly articles: it is a pioneering Frontiers Media SA. approach to the world of academia, radically improving the way scholarly research is managed. All rights reserved. All content included on this site, The grand vision of Frontiers is a world where all people have an equal opportunity to seek, share such as text, graphics, logos, button and generate knowledge. Frontiers provides immediate and permanent online open access to all icons, images, video/audio clips, downloads, data compilations and its publications, but this alone is not enough to realize our grand goals. software, is the property of or is licensed to Frontiers Media SA (“Frontiers”) or its licensees and/or subcontractors. The copyright in the FRONTIERS JOURNAL SERIES text of individual articles is the property of their respective authors, The Frontiers Journal Series is a multi-tier and interdisciplinary set of open-access, online subject to a license granted to Frontiers. journals, promising a paradigm shift from the current review, selection and dissemination The compilation of articles processes in academic publishing. constituting this e-book, as well as All Frontiers journals are driven by researchers for researchers; therefore, they constitute a service all content on this site is the exclusive property of Frontiers. to the scholarly community. At the same time, the Frontiers Journal Series operates on a revo- Images and graphics not forming lutionary invention, the tiered publishing system, initially addressing specific communities of part of user-contributed materials may not be downloaded or copied scholars, and gradually climbing up to broader public understanding, thus serving the interests without permission. of the lay society, too. Articles and other user-contributed materials may be downloaded and reproduced subject to any copyright or other notices. No financial DEDICATION TO QUALITY payment or reward may be given for any such reproduction except to the Each Frontiers article is a landmark of the highest quality, thanks to genuinely collaborative interac- author(s) of the article concerned. tions between authors and review editors, who include some of the world’s best academicians. As author or other contributor you grant permission to others to Research must be certified by peers before entering a stream of knowledge that may eventually reproduce your articles, including reach the public - and shape society; therefore, Frontiers only applies the most rigorous and any graphics and third-party materials supplied by you, in unbiased reviews. accordance with the Conditions for Frontiers revolutionizes research publishing by freely delivering the most outstanding research, Website Use and subject to any copyright notices which you include evaluated with no bias from both the academic and social point of view. in connection with your articles and By applying the most advanced information technologies, Frontiers is catapulting scholarly materials. publishing into a new generation. All copyright, and all rights therein, are protected by national and international copyright laws. The above represents a summary WHAT ARE FRONTIERS RESEARCH TOPICS? only. For the full conditions see the Conditions for Authors and the Frontiers Research Topics are very popular trademarks of the Frontiers Journals Series: they are Conditions for Website Use. collections of at least ten articles, all centered on a particular subject. With their unique mix Cover image provided by Ibbl sarl, Lausanne CH of varied contributions from Original Research to Review Articles, Frontiers Research Topics unify the most influential researchers, the latest key findings and historical advances in a hot ISSN 1664-8714 research area! ISBN 978-2-88919-155-0 Find out more on how to host your own Frontiers Research Topic or contribute to one as an DOI 10.3389/978-2-88919-155-0 author by contacting the Frontiers Editorial Office: [email protected] Frontiers in Psychology August 2013 | Sweating the Small Stuff: Does data cleaning and testing of assumptions really matter in the 21st century? | 1 SWEATING THE SMALL STUFF: DOES DATA CLEANING AND TESTING OF ASSUMPTIONS REALLY MATTER IN THE 21ST CENTURY? Topic Editor: Jason W. Osborne, University of Louisville, USA Modern statistical software makes it easier than ever to do thorough data screening/cleaning and to test assumptions associated with the analyses researchers perform. However, few authors (even in top-tier journals) seem to be reporting data cleaning/screening and testing assumptions associated with the statistical analyses being reported. Few popular textbooks seem to focus on these basics. In the 21st Century, with our complex modern analyses, is data screening and cleaning still relevant? Do outliers or extreme scores matter any more? Does having normally distributed variables improve analyses? Are there new techniques for screening or cleaning data that researchers should be aware of? Are most analyses robust to violations of most assumptions, to the point that researchers really don’t need to pay attention to assumptions any more? My goal for this special issue is examine this issue with fresh eyes and 21st century methods. I believe that we can demonstrate that these things do still matter, even when using “robust” methods or non-parametric techniques, and perhaps identify when they matter MOST or in what way they can most substantially affect the results of an analysis. I believe we can encourage researchers to change their habits through evidence-based discussions revolving around these issues. It is possible we can even convince editors of important journals to include these aspects in their evaluation /review criteria, as many journals in the social sciences have done with effect size reporting in recent years. I invite you to join me in demonstrating WHY paying attention to these mundane aspects of quantitative analysis can be beneficial to researchers. Frontiers in Psychology August 2013 | Sweating the Small Stuff: Does data cleaning and testing of assumptions really matter in the 21st century? | 2 Table of Contents 05 Is Data Cleaning and the Testing of Assumptions Relevant in the 21st Century? Jason W. Osborne 08 Are Assumptions of Well-Known Statistical Techniques Checked, and Why (Not)? Rink Hoekstra, Henk A. L. Kiers and Addie Johnson 17 Statistical Conclusion Validity: Some Common Threats and Simple Remedies Miguel A. García-Pérez 28 Is Coefficient Alpha Robust to Non-Normal Data? Yanyan Sheng and Zhaohui Sheng 41 The Assumption of a Reliable Instrument and Other Pitfalls to Avoid When Considering the Reliability of Data Kim Nimon, Linda Reichwein Zientek and Robin K. Henson 54 Replication Unreliability in Psychology: Elusive Phenomena or “Elusive” Statistical Power? Patrizio E. Tressoldi 59 Distribution of Variables by Method of Outlier Detection W. Holmes Finch 71 Statistical Assumptions of Substantive Analyses Across the General Linear Model: A Mini-Review Kim F. Nimon 76 Tools to Support Interpreting Multiple Regression in the Face of Multicollinearity Amanda Kraha, Heather Turner, Kim Nimon, Linda Reichwein Zientek and Robin K. Henson 92 A Simple Statistic for Comparing Moderation of Slopes and Correlations Michael Smithson 101 Old and New Ideas for Data Screening and Assumption Testing for Exploratory and Confirmatory Factor Analysis David B. Flora, Cathy LaBrish and R. Philip Chalmers 122 On the Relevance of Assumptions Associated with Classical Factor Analytic Approaches† Daniel Kasper and Ali Ünlü Frontiers in Psychology August 2013 | Sweating the Small Stuff: Does data cleaning and testing of assumptions really matter in the 21st century? | 3 142 How Predictable are “Spontaneous Decisions” and “Hidden Intentions”? Comparing Classification Results Based on Previous Responses with Multivariate Pattern Analysis of fMRI BOLD Signals Martin Lages and Katarzyna Jaworska 150 Using Classroom Data to Teach Students About Data Cleaning and Testing Assumptions Kevin Cummiskey, Shonda Kuiper and Rodney Sturdivant Frontiers in Psychology August 2013 | Sweating the Small Stuff: Does data cleaning and testing of assumptions really matter in the 21st century? | 4 EDITORIAL published: 25 June 2013 doi: 10.3389/fpsyg.2013.00370 Is data cleaning and the testing of assumptions relevant in the 21st century? Jason W. Osborne* Educational and Counseling Psychology, University of Louisville, Louisville, Kentucky, USA *Correspondence: [email protected] Edited by: Axel Cleeremans, Université Libre de Bruxelles, Belgium You must understand fully what your assumptions say and what By the middle of the 20th century, researchers had assembled they imply. You must not claim that the “usual assumptions” are some evidence that some minimal violations of some assumptions acceptable due to the robustness of your technique unless you had minimal effects on error rates under certain circumstances— really understand the implications and limits of this assertion in in other words, if your variances are not exactly identical across the context of your application. And you must absolutely never all groups, but are relatively close, it is probably acceptable to use any statistical method without realizing that you are implic- interpret the results of that