Module 5: Statistical Analyses

Total Page:16

File Type:pdf, Size:1020Kb

Module 5: Statistical Analyses R. Pitt April 5, 2007 Module 5: Statistical Analyses INTRODUCTION .......................................................................................................................... 2 GENERAL STEPS IN THE ANALYSIS OF DATA .................................................................. 2 EXPERIMENTAL DESIGN .................................................................................................................... 3 Sample size................................................................................................................................... 4 Determination of Outliers ............................................................................................................ 5 SELECTION OF STATISTICAL PROCEDURES ........................................................................................ 5 Statistical Power .......................................................................................................................... 5 Comparison Tests......................................................................................................................... 5 Data Associations and Model Building........................................................................................ 7 EXPLORATORY DATA ANALYSES ...................................................................................................... 8 Basic Data Plots........................................................................................................................... 8 Probability Plots .......................................................................................................................... 8 Digidot Plot................................................................................................................................ 13 Scatterplots................................................................................................................................. 14 Grouped Box and Whisker Plots................................................................................................ 17 COMPARING MULTIPLE SETS OF DATA WITH GROUP COMPARISON TESTS...................................... 19 Simple Comparison Tests with Two Groups .............................................................................. 20 Comparisons of Many Groups ................................................................................................... 23 DATA ASSOCIATIONS ...................................................................................................................... 23 Correlation Matrices.................................................................................................................. 24 Hierarchical Cluster Analyses ...................................................................................................26 Principal Component Analyses (PCA) and Factor Analyses..................................................... 29 ANALYSIS OF TRENDS IN RECEIVING WATER INVESTIGATIONS....................................................... 33 Preliminary Evaluations before Trend Analyses are Used........................................................ 34 Statistical Methods Available for Detecting Trends................................................................... 35 Example of Long-Term Trend Analyses for Lake Rönningesjön, Sweden ................................. 35 EXAMPLE STORMWATER DATA ANALYSIS.................................................................... 50 SAMPLING EFFORT AND BASIC DATA PRESENTATIONS ................................................................... 50 SUMMARY OF DATA ........................................................................................................................ 55 Data Summaries......................................................................................................................... 55 EXPLORATORY DATA ANALYSIS OF RAINFALL AND RUNOFF CHARACTERISTICS FOR URBAN AREAS57 EVALUATION OF DATA GROUPINGS AND ASSOCIATIONS................................................................. 64 Exploratory Data Analyses ........................................................................................................ 64 Simple Correlation Analyses...................................................................................................... 68 Complex Correlation Analyses................................................................................................... 72 Model Building........................................................................................................................... 78 “Outliers” and Extreme Observations....................................................................................... 99 STATISTICAL EVALUATION OF A WATER TREATMENT CONTROL DEVICE; THE UPFLOW FILTER ....................................................................................................................................... 107 CONTROLLED EXPERIMENTS ......................................................................................................... 107 ACTUAL STORM EVENT MONITORING........................................................................................... 108 OTHER EXPLORATORY DATA METHODS USED TO EVALUATE STORMWATER CONTROLS.............. 114 EVALUATION OF BACTERIA DECAY COEFFICIENTS FOR FATE ANALYSES ..... 120 FATE MECHANISMS FOR MICROORGANISMS.................................................................................. 120 DECAY RATE CURVES OF LAKE MICROORGANISMS ...................................................................... 125 REFERENCES ........................................................................................................................... 127 APPENDIX A: FACTORIAL ANALYSES EXAMPLES...................................................... 130 EXAMPLES OF AN EXPERIMENTAL DESIGN USING FACTORIAL ANALYSES: SEDIMENT SCOUR....... 130 Introduction.............................................................................................................................. 130 Experimental Design................................................................................................................ 132 Results ...................................................................................................................................... 132 EXAMPLE USING FACTORIAL ANALYSES TO EVALUATE EXISTING DATA: LAKE TUSCALOOSA WATER QUALITY 139 Introduction.............................................................................................................................. 139 Experimental Design for Lake Tuscaloosa .............................................................................. 141 Experimental Design and Factorial Analysis for the North River site .................................... 141 Summary................................................................................................................................... 143 FACTORIAL ANALYSIS USED IN MODELING THE FATES OF POLYCYCLIC AROMATIC HYDROCARBONS (PAHS) AFFECTING TREATABILITY OF STORMWATER................................................................................ 145 Abstract .................................................................................................................................... 145 Introduction.............................................................................................................................. 145 Methodology............................................................................................................................. 146 Results ...................................................................................................................................... 147 Conclusions.............................................................................................................................. 151 Acknowledgements ................................................................................................................... 152 APPENDIX B: EXAMPLES FOR SPECIFIC STATISTICAL TESTS ............................... 153 PROBABILITY PLOT PREPARATION USING EXCEL .......................................................................... 153 COMPARISONS OF TWO SETS OF DATA USING EXCEL .................................................................... 162 Paired Tests: ............................................................................................................................ 162 Independent Tests: ................................................................................................................... 163 EXAMPLE OF ANOVA USING EXCEL............................................................................................. 164 EXAMPLE REGRESSION ANALYSIS USING EXCEL........................................................................... 165 OTHER STATISTICAL TESTS AVAILABLE IN EXCEL........................................................................ 169 WILCOXON RANK-SUM TEST ........................................................................................................ 170 Introduction Statistical analyses are a critical component of research. The analyses that are to be conducted for a specific research activity must be carefully thought out in advance of any data collection and be an integral component of the experimental design activities. This module reviews a number of statistical tests that have been useful for a variety of water quality projects conducted by the author. The field of statistical
Recommended publications
  • Statistical Analysis in JASP
    Copyright © 2018 by Mark A Goss-Sampson. All rights reserved. This book or any portion thereof may not be reproduced or used in any manner whatsoever without the express written permission of the author except for the purposes of research, education or private study. CONTENTS PREFACE .................................................................................................................................................. 1 USING THE JASP INTERFACE .................................................................................................................... 2 DESCRIPTIVE STATISTICS ......................................................................................................................... 8 EXPLORING DATA INTEGRITY ................................................................................................................ 15 ONE SAMPLE T-TEST ............................................................................................................................. 22 BINOMIAL TEST ..................................................................................................................................... 25 MULTINOMIAL TEST .............................................................................................................................. 28 CHI-SQUARE ‘GOODNESS-OF-FIT’ TEST............................................................................................. 30 MULTINOMIAL AND Χ2 ‘GOODNESS-OF-FIT’ TEST. ..........................................................................
    [Show full text]
  • Statistical Analysis in JASP V0.10.0- a Students Guide .Pdf
    2nd Edition JASP v0.10.0 June 2019 Copyright © 2019 by Mark A Goss-Sampson. All rights reserved. This book or any portion thereof may not be reproduced or used in any manner whatsoever without the express written permission of the author except for the purposes of research, education or private study. CONTENTS PREFACE .................................................................................................................................................. 1 USING THE JASP ENVIRONMENT ............................................................................................................ 2 DATA HANDLING IN JASP ........................................................................................................................ 7 JASP ANALYSIS MENU ........................................................................................................................... 10 DESCRIPTIVE STATISTICS ....................................................................................................................... 12 EXPLORING DATA INTEGRITY ................................................................................................................ 21 DATA TRANSFORMATION ..................................................................................................................... 29 ONE SAMPLE T-TEST ............................................................................................................................. 33 BINOMIAL TEST ....................................................................................................................................
    [Show full text]
  • Null Hypothesis Against an Alternative Hypothesis
    Medical Statistics with R Dr. Gulser Caliskan Prof. Giuseppe Verlato Unit of Epidemiology and Medical Statistics Department of Diagnostics and Public Health University of Verona, Italy LESSON 4 INDEX 1. Independent T-Test 2. Wilcoxon Rank-Sum (Mann-Whitney U Test) 3. Paired T-Test 4. Wilcoxon Sign Test 5. ANOVA (Analysis of Varyans) 6. Kruskal Wallis H Test Classical Hypothesis Testing Test of a null hypothesis against an alternative hypothesis. There are five steps, the first four of which should be done before inspecting the data. Step 1. Declare the null hypothesis H0 and the alternative hypothesis H1. Types Of Hypotheses A hypothesis that completely specifies the parameters is called simple. If it leaves some parameter undetermined it is composite. A hypothesis is one-sided if it proposes that a parameter is > some value or < some value; it is two-sided if it simply says the parameter is /= some value. Types of Error Rejecting H0 when it is actually true is called a Type I Error. In biomedical settings it can be considered a false positive. (Null hypothesis says “nothing is happening” but we decide“there is disease”.) Step 2. Specify an acceptable level of Type I error, α, normally 0.05 or 0.01. This is the threshold used in deciding to reject H0 or not. If α = 0.05 and we determine the probability of our data assuming H0 is 0.0001, then we reject H0. The Test Statistic Step 3. Select a test statistic. This is a quantity calculated from the data whose value leads me to reject the null hypothesis or not.
    [Show full text]
  • One-Way Analysis of Variance
    NCSS Statistical Software NCSS.com Chapter 210 One-Way Analysis of Variance Introduction This procedure performs an F-test from a one-way (single-factor) analysis of variance, Welch’s test, the Kruskal- Wallis test, the van der Waerden Normal-Scores test, and the Terry-Hoeffding Normal-Scores test on data contained in either two or more variables or in one variable indexed by a second (grouping) variable. The one- way analysis of variance compares the means of two or more groups to determine if at least one group mean is different from the others. The F-ratio is used to determine statistical significance. The tests are non-directional in that the null hypothesis specifies that all means are equal and the alternative hypothesis simply states that at least one mean is different. This procedure also checks the equal variance (homoscedasticity) assumption using the Levene test, Brown- Forsythe test, Conover test, and Bartlett test. Kinds of Research Questions One of the most common tasks in research is to compare two or more populations (groups). We might want to compare the income level of two regions, the nitrogen content of three lakes, or the effectiveness of four drugs. The first question that arises concerns which aspects (parameters) of the populations we should compare. We might consider comparing the means, medians, standard deviations, distributional shapes (histograms), or maximum values. We base the comparison parameter on our particular problem. One of the simplest comparisons we can make is between the means of two or more populations. If we can show that the mean of one population is different from that of the other populations, we can conclude that the populations are different.
    [Show full text]
  • Statistical Methods in Water Resources
    Techniques of Water-Resources Investigations of the United States Geological Survey Book 4, Hydrologic Analysis and Interpretation Chapter A3 Statistical Methods in Water Resources By D.R. Helsel and R.M. Hirsch U.S. DEPARTMENT OF THE INTERIOR GALE A. NORTON, Secretary U.S. GEOLOGICAL SURVEY Charles G. Groat, Director September 2002 The use of firm, trade, and brand names in this report is for identification purposes only and does not constitute endorsement by the U.S. Geological Survey. Publication available at: http://water.usgs.gov/pubs/twri/twri4a3/ Table of Contents Preface xi Chapter 1 Summarizing Data 1 1.1 Characteristics of Water Resources Data 2 1.2 Measures of Location 3 1.2.1 Classical Measure -- the Mean 3 1.2.2 Resistant Measure -- the Median 5 1.2.3 Other Measures of Location 6 1.3 Measures of Spread 7 1.3.1 Classical Measures 7 1.3.2 Resistant Measures 8 1.4 Measures of Skewness 9 1.4.1 Classical Measure of Skewness 9 1.4.2 Resistant Measure of Skewness 10 1.5 Other Resistant Measures 10 1.6 Outliers 11 1.7 Transformations 12 1.7.1 The Ladder of Powers 12 Chapter 2 Graphical Data Analysis 17 2.1 Graphical Analysis of Single Data Sets 19 2.1.1 Histograms 19 2.1.2 Stem and Leaf Diagrams 20 2.1.3 Quantile Plots 22 2.1.4 Boxplots 24 2.1.5 Probability Plots 26 2.2 Graphical Comparisons of Two or More Data Sets 35 2.2.1 Histograms 35 2.2.2 Dot and Line Plots of Means, Standard Deviations 35 2.2.3 Boxplots 38 2.2.4 Probability Plots 40 2.2.5 Q-Q Plots 41 2.3 Scatterplots and Enhancements 45 ii 2.3.1 Evaluating Linearity 45 2.3.2
    [Show full text]
  • Robustness and Comparative Statistical Power of the Repeated Measures Anova and Friedman Test with Real Data
    ROBUSTNESS AND COMPARATIVE STATISTICAL POWER OF THE REPEATED MEASURES ANOVA AND FRIEDMAN TEST WITH REAL DATA By OPEOLUWA BOLU FADEYI DISSERTATION Submitted to the Graduate School Of Wayne State University, Detroit, Michigan In partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY 2021 MAJOR: EVALUATION AND RESEARCH Approved by: ____________________________________ Advisor Date ____________________________________ ____________________________________ ____________________________________ i DEDICATION To my husband, children, and parents. ii ACKNOWLEDGEMENTS I would like to express my profound gratitude to my dissertation advisor Dr. Shlomo Sawilowsky for his insightful guidance. It is difficult for me to comprehend how he made out time to review my dissertation progressively and thoroughly. His demise was shocking and devastating as my study was concluding. However, he has written his name in gold in my heart. I sincerely appreciate Dr. Barry Markman for all his helpful comments and also for stepping out of retirement to chair my final defense. The course that I took with Dr. Monte Piliawsky impacted the quality of my research significantly. On several instances, Dr. Aguwa reviewed my work and contributed immensely to the richness of my research. I am very grateful to these professors for servin g on my dissertation committee. Also, I will like to express my love and thankfulness to my husband, Johnson, who spent sleepless nights with me proofreading my work. Our children, Wisdom, Delight, and Goodness, that endured a busy mom are much appreciated. I sincerely acknowledge several people that have been a part of my life during the doctoral study who are too numerous to be included. Above all, I want to express my unquantifiable gratitude to God.
    [Show full text]
  • Statistical Analysis in JASP V0.9.2
    2nd Edition JASP v0.9.1 October 2018 Copyright © 2018 by Mark A Goss-Sampson. All rights reserved. This book or any portion thereof may not be reproduced or used in any manner whatsoever without the express written permission of the author except for the purposes of research, education or private study. CONTENTS PREFACE .................................................................................................................................................. 1 USING THE JASP INTERFACE .................................................................................................................... 2 DESCRIPTIVE STATISTICS ......................................................................................................................... 9 EXPLORING DATA INTEGRITY ................................................................................................................ 16 DATA TRANSFORMATION ..................................................................................................................... 23 ONE SAMPLE T-TEST ............................................................................................................................. 27 BINOMIAL TEST ..................................................................................................................................... 30 MULTINOMIAL TEST .............................................................................................................................. 33 CHI-SQUARE ‘GOODNESS-OF-FIT’ TEST............................................................................................
    [Show full text]
  • Doing Better Statistics in Human-Computer Interaction
    DOING BETTER STATISTICS IN HUMAN–COMPUTER INTERACTION Each chapter of this book covers specific topics in statistical analysis, such as robust alternatives to t-tests or how to develop a question- naire. They also address particular questions on these topics that are commonly asked by HCI researchers when planning or completing the analysis of their data. The book presents the current best practice in statistics, drawing on the state-of-the-art literature that is rarely presented in HCI. This is achieved by providing strong arguments that support good statistical analysis without relying on mathematical explanations. It additionally offers some philosophical underpinning for statistics, so that readers can see how statistics fit with experimental design and the fundamental goal of discovering new HCI knowledge. paul cairns is a Reader in Human–Computer Interaction at the University of York, UK, and Scholar-in-Residence for The AbleGamers Charity, which helps people with disabilities combat social isolation by making videogames more accessible. He has taught statistics at all levels of education for nearly twenty years. His particular research interest is in players’ experiences of digital games, and his expertise in experimental and statistical methods was developed through working in this area. DOING BETTER STATISTICS IN HUMAN–COMPUTER INTERACTION PAUL CAIRNS University of York University Printing House, Cambridge cb2 8bs, United Kingdom One Liberty Plaza, 20th Floor, New York, ny 10006, USA 477 Williamstown Road, Port Melbourne, vic 3207, Australia 314–321, 3rd Floor, Plot 3, Splendor Forum, Jasola District Centre, New Delhi – 110025, India 79 Anson Road, #06–04/06, Singapore 079906 Cambridge University Press is part of the University of Cambridge.
    [Show full text]
  • Statistical Methods in Water Resources
    Techniques of Water-Resources Investigations of the United States Geological Survey Book 4, Hydrologic Analysis and Interpretation Chapter A3 Statistical Methods in Water Resources By D.R. Helsel and R.M. Hirsch U.S. DEPARTMENT OF THE INTERIOR GALE A. NORTON, Secretary U.S. GEOLOGICAL SURVEY Charles G. Groat, Director September 2002 The use of firm, trade, and brand names in this report is for identification purposes only and does not constitute endorsement by the U.S. Geological Survey. Publication available at: http://water.usgs.gov/pubs/twri/twri4a3/ Table of Contents Preface xi Chapter 1 Summarizing Data 1 1.1 Characteristics of Water Resources Data 2 1.2 Measures of Location 3 1.2.1 Classical Measure -- the Mean 3 1.2.2 Resistant Measure -- the Median 5 1.2.3 Other Measures of Location 6 1.3 Measures of Spread 7 1.3.1 Classical Measures 7 1.3.2 Resistant Measures 8 1.4 Measures of Skewness 9 1.4.1 Classical Measure of Skewness 9 1.4.2 Resistant Measure of Skewness 10 1.5 Other Resistant Measures 10 1.6 Outliers 11 1.7 Transformations 12 1.7.1 The Ladder of Powers 12 Chapter 2 Graphical Data Analysis 17 2.1 Graphical Analysis of Single Data Sets 19 2.1.1 Histograms 19 2.1.2 Stem and Leaf Diagrams 20 2.1.3 Quantile Plots 22 2.1.4 Boxplots 24 2.1.5 Probability Plots 26 2.2 Graphical Comparisons of Two or More Data Sets 35 2.2.1 Histograms 35 2.2.2 Dot and Line Plots of Means, Standard Deviations 35 2.2.3 Boxplots 38 2.2.4 Probability Plots 40 2.2.5 Q-Q Plots 41 2.3 Scatterplots and Enhancements 45 ii 2.3.1 Evaluating Linearity 45 2.3.2
    [Show full text]
  • Statistical Analysis in JASP
    DOI: 10.6084/m9.figshare.9980744 4th Edition JASP v0.14 2020 Copyright © 2020 by Mark A Goss-Sampson. Licenced as CC BY 4.0 All rights reserved. This book or any portion thereof may not be reproduced or used in any manner whatsoever without the express written permission of the author except for research, education or private study. CONTENTS PREFACE .................................................................................................................................................. 1 USING THE JASP ENVIRONMENT ............................................................................................................ 2 DATA HANDLING IN JASP ........................................................................................................................ 8 JASP ANALYSIS MENU ........................................................................................................................... 11 DESCRIPTIVE STATISTICS ....................................................................................................................... 14 DESCRIPTIVE PLOTS IN JASP .............................................................................................................. 19 SPLITTING DATA FILES ....................................................................................................................... 23 EXPLORING DATA INTEGRITY ................................................................................................................ 25 DATA TRANSFORMATION ....................................................................................................................
    [Show full text]