18.650 – Fundamentals of Statistics 26Mm 4. Hypothesis Testing

Total Page:16

File Type:pdf, Size:1020Kb

18.650 – Fundamentals of Statistics 26Mm 4. Hypothesis Testing 18.650 { Fundamentals of Statistics 4. Hypothesis testing 1/47 Goals We have seen the basic notions of hypothesis testing: I Hypotheses H0/H1, I Type 1/Type 2 error, level and power I Test statistics and rejection region I p-value Our tests were based on CLT (and sometimes Slutsky). 2 I What if data is Gaussian, σ is unknown and Slutsky does not apply? I Can we use asymptotic normality of MLE? I Tests about multivariate parameters θ = (θ1; : : : ; θd) (e.g.: θ1 = θ2)? I More complex tests: "Does my data follow a Gaussian distribution"? 2/47 Parametric hypothesis testing 3/47 Clinical trials Let us go through an example to remind the main notions of hypothesis testing. I Pharmaceutical companies use hypothesis testing to test if a new drug is efficient. I To do so, they administer a drug to a group of patients (test group) and a placebo to another group (control group). I We consider testing a drug that is supposed to lower LDL (low-density lipoprotein), a.k.a "bad cholesterol" among patients with a high level of LDL (above 200 mg/dL) 4/47 Notation and modelling I Let ∆d > 0 denote the expected decrease of LDL level (in mg/dL) for a patient that has used the drug. I Let ∆c > 0 denote the expected decrease of LDL level (in mg/dL) for a patient that has used the placebo. I We want to know if I We observe two independent samples: iid 2 I X1;:::;Xn ∼ N ( ; σd) from the test group and iid 2 I Y1;:::;Ym ∼ N ( ; σc ) from the group. 5/47 Hypothesis testing I Hypotheses: H0 : vs. H1 : I Since the data is Gaussian by assumption we don't need the I We have X¯n ∼ and Y¯m ∼ I Therefore X¯n − Y¯m − (∆ − ∆ ) d c ∼ N (0; 1) 6/47 Asymptotic test I Assume that m = cn and n ! 1 I Using lemma, we also have (d) −−−! N (0; 1) n!1 where n m 2 1 X 2 2 1 X 2 σ^ = (Xi − X¯n) and σ^ = (Yi − Y¯m) d n c m i=1 i=1 I We get the the following test at asymptotic level α: 8 9 < = R = : ; I This is -sided, -sample test. 7/47 Asymptotic test I Example n = 70; m = 50, X¯n = 156:4, Y¯m = 132:7, 2 2 σ^d = 5198:4, σ^c = 3867:0, 156:4 − 132:7 = 1:57 q 5198:4 3867:0 70 + 50 Since q5% = 1:645, we I We can also compute the p-value: p-value = = 0:0582 8/47 8/47 Small sample size I What if n = 20; m = 12? I We cannot realistically apply Slutsky's lemma I We needed it to find the (asymptotic) distribution of quantities of the form X¯n − µ σ^2 iid 2 when X1;:::;Xn ∼ N (µ, σ ). I It turns out that this distribution does not depend on µ or σ so we can compute its 9/47 The χ2 distribution Definition For a positive integer d, the χ2 (pronounced \Kai-squared") distribution with d degrees of freedom is the law of the random 2 2 2 iid variable Z1 + Z2 + ::: + Zd , where Z1;:::;Zd ∼ N (0; 1). Examples: 2 I If Z ∼ Nd(0;Id), then kZk2 ∼ 2 I χ2 = Exp(1=2). 10/47 1.2 1.0 0.8 df=1 0.6 0.4 0.2 0.0 0 5 10 15 20 25 10/47 1.2 1.0 0.8 df=2 0.6 0.4 0.2 0.0 0 5 10 15 20 25 10/47 1.2 1.0 0.8 df=3 0.6 0.4 0.2 0.0 0 5 10 15 20 25 10/47 1.2 1.0 0.8 df=4 0.6 0.4 0.2 0.0 0 5 10 15 20 25 10/47 1.2 1.0 0.8 df=5 0.6 0.4 0.2 0.0 0 5 10 15 20 25 10/47 1.2 1.0 0.8 df=10 0.6 0.4 0.2 0.0 0 5 10 15 20 25 10/47 1.2 1.0 0.8 df=20 0.6 0.4 0.2 0.0 0 5 10 15 20 25 10/47 Properties χ2 distribution (2) Definition For a positive integer d, the χ2 (pronounced \Kai-squared") distribution with d degrees of freedom is the law of the random 2 2 2 iid variable Z1 + Z2 + ::: + Zd , where Z1;:::;Zd ∼ N (0; 1). 2 Properties: If V ∼ χk, then I IE[V ] = I var[V ] = 11/47 Important example: the sample variance I Recall that the sample variance is given by n n 1 X 2 1 X 2 2 Sn = (Xi − X¯n) = X − (X¯n) n n i i=1 i=1 iid 2 I Cochran's theorem states that for X1;:::;Xn ∼ N (µ, σ ), if Sn is the sample variance, then I X¯n ?? Sn; nSn 2 I ∼ χ : σ2 n−1 2 I We often prefer the unbiased estimator of σ : 12/47 Student's T distribution Definition For a positive integer d, the Student's T distribution with d degrees of freedom (denoted by td) is the law of the random Z 2 variable p , where Z ∼ N (0; 1), V ∼ χd and Z ?? V (Z is V=d independent of V ). 13/47 14/47 Who was Student? This distribution was introduced by William Sealy Gosset (1876{1937) in 1908 while he worked for the Guinness brewery in Dublin, Ireland. 15/47 Student's T test (one sample, two-sided) iid 2 2 I Let X1;:::;Xn ∼ N (µ, σ ) where both µ and σ are unknown I We want to test: H0 : µ = 0; vs H1 : µ =6 0 Test statistic: I p nX¯n Tn = p = S~n p 2 I Since nX¯n/σ ∼ (under ) and S~n/σ ∼ are independent by theorem, we have: Tn ∼ I Student's test with (non asymptotic) level α 2 (0; 1): α = 1IfjTnj > qα/2g; where qα/2 is the (1 − α=2)-quantile of tn−1. 16/47 Student's T test (one sample, one-sided) I We want to test: H0 : µ ≤ µ0; vs H1 : µ > µ0 I Test statistic: p n(X¯n ) Tn = p ∼ S~n under H0. I Student's test with (non asymptotic) level α 2 (0; 1): α = 1I ; 17/47 Two-sample T-test I Back to our cholesterol example. What happens for small sample sizes? I We want to know the distribution of X¯n − Y¯m − (∆d − ∆c) q 2 2 σ^d σ^c n + m I We have approximately X¯n − Y¯m − (∆d − ∆c) ∼ tN q 2 2 σ^d σ^c n + m where σ^2=n +σ ^2=m2 d c ≥ N = 4 4 min(n; m) σ^d σ^c n2(n−1) + m2(m−1) (Welch-Satterthwaite formula) 18/47 Non-asymptotic test I Example n = 70; m = 50, X¯n = 156:4, Y¯m = 132:7, 2 2 σ^d = 5198:4, σ^c = 3867:0, 156:4 − 132:7 = 1:57 q 5198:4 3867:0 70 + 50 I Using the shorthand formula N = min(n; m) = , we get q5% = 1:68 and p-value = = 0:0614 I Using the W-S formula 5198:4 + 3867:0 2 N = 70 50 = 113:78 5198:42 3867:02 702(70−1) + 502(50−1) we round to . I We get p-value = = 0:0596 19/47 Discussion Advantage of Student's test: Non asymptotic / Can be run on small samples Drawback of Student's test: It relies on the assumption that the sample is Gaussian (soon we will see how to test this assumption) 20/47 A test based on the MLE I Consider an i.i.d. sample X1;:::;Xn with statistical model d (E; (IPθ)θ2Θ), where Θ ⊆ IR (d ≥ 1) and let θ0 2 Θ be fixed and given. I Consider the following hypotheses: ( H0 : θ = θ0 H1 : θ =6 θ0: MLE I Let θ^ be the MLE. Assume the MLE technical conditions are satisfied. I If H0 is true, then MLE (d) × θ^ − θ0 −−−! Nd (0;Id) n n!1 21/47 Wald's test I Hence, > MLE MLE MLE (d) n θ^ − θ0 I(θ^ ) θ^ − θ0 −−−! n n n!1 | {z } Tn I Wald's test with asymptotic level α 2 (0; 1): = 1IfTn > qαg; 2 where qα is the (1 − α)-quantile of χd (see tables). I Remark: Wald's test is also valid if H1 has the form \θ > θ0" or \θ < θ0" or \θ = θ1"... 22/47 A test based on the log-likelihood I Consider an i.i.d. sample X1;:::;Xn with statistical model d (E; (IPθ)θ2Θ), where Θ ⊆ IR (d ≥ 1). I Suppose the null hypothesis has the form (0) (0) H0 :(θr+1; : : : ; θd) = (θr+1; : : : ; θd ); (0) (0) for some fixed and given numbers θr+1; : : : ; θd . I Let θ^n = argmax `n(θ) (MLE) θ2Θ and ^c θn = argmax `n(θ) (\constrained MLE") θ2Θ0 where Θ0 = 23/47 Likelihood ratio test Test statistic: ^ ^c Tn = 2 `n(θn) − `n(θn) : Wilks' Theorem Assume H0 is true and the MLE technical conditions are satisfied. Then, (d) Tn −−−! n!1 Likelihood ratio test with asymptotic level α 2 (0; 1): = 1IfTn > qαg; 2 where qα is the (1 − α)-quantile of χd−r (see tables).
Recommended publications
  • Section 13 Kolmogorov-Smirnov Test
    Section 13 Kolmogorov-Smirnov test. Suppose that we have an i.i.d. sample X1; : : : ; Xn with some unknown distribution P and we would like to test the hypothesis that P is equal to a particular distribution P0; i.e. decide between the following hypotheses: H : P = P ; H : P = P : 0 0 1 6 0 We already know how to test this hypothesis using chi-squared goodness-of-fit test. If dis­ tribution P0 is continuous we had to group the data and consider a weaker discretized null hypothesis. We will now consider a different test for H0 based on a very different idea that avoids this discretization. 1 1 men data 0.9 0.9 'men' fit all data women data 0.8 normal fit 0.8 'women' fit 0.7 0.7 0.6 0.6 0.5 0.5 0.4 0.4 Cumulative probability 0.3 Cumulative probability 0.3 0.2 0.2 0.1 0.1 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5 Data Data Figure 13.1: (a) Normal fit to the entire sample. (b) Normal fit to men and women separately. Example.(KS test) Let us again look at the normal body temperature dataset. Let ’all’ be a vector of all 130 observations and ’men’ and ’women’ be vectors of length 65 each corresponding to men and women. First, we fit normal distribution to the entire set ’all’.
    [Show full text]
  • Downloaded 09/24/21 09:09 AM UTC
    MARCH 2007 NOTES AND CORRESPONDENCE 1151 NOTES AND CORRESPONDENCE A Cautionary Note on the Use of the Kolmogorov–Smirnov Test for Normality DAG J. STEINSKOG Nansen Environmental and Remote Sensing Center, and Bjerknes Centre for Climate Research, and Geophysical Institute, University of Bergen, Bergen, Norway DAG B. TJØSTHEIM Department of Mathematics, University of Bergen, Bergen, Norway NILS G. KVAMSTØ Geophysical Institute, University of Bergen, and Bjerknes Centre for Climate Research, Bergen, Norway (Manuscript received and in final form 28 April 2006) ABSTRACT The Kolmogorov–Smirnov goodness-of-fit test is used in many applications for testing normality in climate research. This note shows that the test usually leads to systematic and drastic errors. When the mean and the standard deviation are estimated, it is much too conservative in the sense that its p values are strongly biased upward. One may think that this is a small sample problem, but it is not. There is a correction of the Kolmogorov–Smirnov test by Lilliefors, which is in fact sometimes confused with the original Kolmogorov–Smirnov test. Both the Jarque–Bera and the Shapiro–Wilk tests for normality are good alternatives to the Kolmogorov–Smirnov test. A power comparison of eight different tests has been undertaken, favoring the Jarque–Bera and the Shapiro–Wilk tests. The Jarque–Bera and the Kolmogorov– Smirnov tests are also applied to a monthly mean dataset of geopotential height at 500 hPa. The two tests give very different results and illustrate the danger of using the Kolmogorov–Smirnov test. 1. Introduction estimates of skewness and kurtosis. Such moments can be used to diagnose nonlinear processes and to provide The Kolmogorov–Smirnov test (hereafter the KS powerful tools for validating models (see also Ger- test) is a much used goodness-of-fit test.
    [Show full text]
  • Exponentiality Test Using a Modified Lilliefors Test Achut Prasad Adhikari
    University of Northern Colorado Scholarship & Creative Works @ Digital UNC Dissertations Student Research 5-1-2014 Exponentiality Test Using a Modified Lilliefors Test Achut Prasad Adhikari Follow this and additional works at: http://digscholarship.unco.edu/dissertations Recommended Citation Adhikari, Achut Prasad, "Exponentiality Test Using a Modified Lilliefors Test" (2014). Dissertations. Paper 63. This Text is brought to you for free and open access by the Student Research at Scholarship & Creative Works @ Digital UNC. It has been accepted for inclusion in Dissertations by an authorized administrator of Scholarship & Creative Works @ Digital UNC. For more information, please contact [email protected]. UNIVERSITY OF NORTHERN COLORADO Greeley, Colorado The Graduate School AN EXPONENTIALITY TEST USING A MODIFIED LILLIEFORS TEST A Dissertation Submitted in Partial Fulfillment Of the Requirements for the Degree of Doctor of Philosophy Achut Prasad Adhikari College of Education and Behavioral Sciences Department of Applied Statistics and Research Methods May 2014 ABSTRACT Adhikari, Achut Prasad An Exponentiality Test Using a Modified Lilliefors Test. Published Doctor of Philosophy dissertation, University of Northern Colorado, 2014. A new exponentiality test was developed by modifying the Lilliefors test of exponentiality for the purpose of improving the power of the test it directly modified. Lilliefors has considered the maximum absolute differences between the sample empirical distribution function (EDF) and the exponential cumulative distribution function (CDF). The proposed test considered the sum of all the absolute differences between the CDF and EDF. By considering the sum of all the absolute differences rather than only a point difference of each observation, the proposed test would expect to be less affected by individual extreme (too low or too high) observations and capable of detecting smaller, but consistent, differences between the distributions.
    [Show full text]
  • Power Comparisons of Shapiro-Wilk, Kolmogorov-Smirnov, Lilliefors and Anderson-Darling Tests
    Journal ofStatistical Modeling and Analytics Vol.2 No.I, 21-33, 2011 Power comparisons of Shapiro-Wilk, Kolmogorov-Smirnov, Lilliefors and Anderson-Darling tests Nornadiah Mohd Razali1 Yap Bee Wah 1 1Faculty ofComputer and Mathematica/ Sciences, Universiti Teknologi MARA, 40450 Shah Alam, Selangor, Malaysia E-mail: nornadiah@tmsk. uitm. edu.my, yapbeewah@salam. uitm. edu.my ABSTRACT The importance of normal distribution is undeniable since it is an underlying assumption of many statistical procedures such as I-tests, linear regression analysis, discriminant analysis and Analysis of Variance (ANOVA). When the normality assumption is violated, interpretation and inferences may not be reliable or valid. The three common procedures in assessing whether a random sample of independent observations of size n come from a population with a normal distribution are: graphical methods (histograms, boxplots, Q-Q-plots), numerical methods (skewness and kurtosis indices) and formal normality tests. This paper* compares the power offour formal tests of normality: Shapiro-Wilk (SW) test, Kolmogorov-Smirnov (KS) test, Lillie/ors (LF) test and Anderson-Darling (AD) test. Power comparisons of these four tests were obtained via Monte Carlo simulation of sample data generated from alternative distributions that follow symmetric and asymmetric distributions. Ten thousand samples ofvarious sample size were generated from each of the given alternative symmetric and asymmetric distributions. The power of each test was then obtained by comparing the test of normality statistics with the respective critical values. Results show that Shapiro-Wilk test is the most powerful normality test, followed by Anderson-Darling test, Lillie/ors test and Kolmogorov-Smirnov test. However, the power ofall four tests is still low for small sample size.
    [Show full text]
  • Univariate Analysis and Normality Test Using SAS, STATA, and SPSS
    © 2002-2006 The Trustees of Indiana University Univariate Analysis and Normality Test: 1 Univariate Analysis and Normality Test Using SAS, STATA, and SPSS Hun Myoung Park This document summarizes graphical and numerical methods for univariate analysis and normality test, and illustrates how to test normality using SAS 9.1, STATA 9.2 SE, and SPSS 14.0. 1. Introduction 2. Graphical Methods 3. Numerical Methods 4. Testing Normality Using SAS 5. Testing Normality Using STATA 6. Testing Normality Using SPSS 7. Conclusion 1. Introduction Descriptive statistics provide important information about variables. Mean, median, and mode measure the central tendency of a variable. Measures of dispersion include variance, standard deviation, range, and interquantile range (IQR). Researchers may draw a histogram, a stem-and-leaf plot, or a box plot to see how a variable is distributed. Statistical methods are based on various underlying assumptions. One common assumption is that a random variable is normally distributed. In many statistical analyses, normality is often conveniently assumed without any empirical evidence or test. But normality is critical in many statistical methods. When this assumption is violated, interpretation and inference may not be reliable or valid. Figure 1. Comparing the Standard Normal and a Bimodal Probability Distributions Standard Normal Distribution Bimodal Distribution .4 .4 .3 .3 .2 .2 .1 .1 0 0 -5 -3 -1 1 3 5 -5 -3 -1 1 3 5 T-test and ANOVA (Analysis of Variance) compare group means, assuming variables follow normal probability distributions. Otherwise, these methods do not make much http://www.indiana.edu/~statmath © 2002-2006 The Trustees of Indiana University Univariate Analysis and Normality Test: 2 sense.
    [Show full text]
  • Hypothesis Testing III
    Kolmogorov-Smirnov test Mann-Whitney test Normality test Brief summary Quiz Hypothesis testing III Botond Szabo Leiden University Leiden, 16 April 2018 Kolmogorov-Smirnov test Mann-Whitney test Normality test Brief summary Quiz Outline 1 Kolmogorov-Smirnov test 2 Mann-Whitney test 3 Normality test 4 Brief summary 5 Quiz Kolmogorov-Smirnov test Mann-Whitney test Normality test Brief summary Quiz One-sample Kolmogorov-Smirnov test IID observations X1;:::; Xn∼ F : Want to test H0 : F = F0 versus H1 : F 6=F0; where F0 is some fixed CDF. Test statistic p p Tn = nDn = n sup jF^n(x) − F0(x)j x If X1;:::; Xn ∼ F0 and F0 is continuous, then the asymptotic distribution of Tn is independent of F0 and is given by the Kolmogorov distribution. Reject the null hypothesis if Tn > Kα; where Kα is the 1 − α quantile of the Kolmogorov distribution. Kolmogorov-Smirnov test Mann-Whitney test Normality test Brief summary Quiz Kolmogorov-Smirnov test (estimated parameters) IID data X1;:::; Xn ∼ F : Want to test H0 : F 2 F versus H1 : F 2= F; where F = fFθ : θ 2 Θg: Test statistic p p ^ Tn = nDn = n sup jFn(x) − Fθ^(x)j; x where θ^ is an estimate of θ based on Xi 's. Astronomers often proceed by treating θ^ as a constant and use the critical values from the usual Kolmogorov-Smirnov test. Kolmogorov-Smirnov test Mann-Whitney test Normality test Brief summary Quiz Kolmogorov-Smirnov test (continued) This is a faulty practice: asymptotically Tn will typically not have the Kolmogorov distribution. Extra work required to obtain the correct critical values.
    [Show full text]
  • Test for Normality #QQ-Plot X<-C(145130120140120
    #Test for Normality #Q-Q-Plot x<-c(145,130,120,140,120) qqnorm(x) qqline(x, col = 2,lwd=3) y<-c(159,147,145,137,135) qqnorm(y) qqline(y, col = 2,lwd=3) #Lilliefors Test #install.packages("nortest") library(nortest) x<-c(145,130,120,140,120) y<-c(159,147,145,137,135) lillie.test(x) ## ## Lilliefors (Kolmogorov-Smirnov) normality test ## ## data: x ## D = 0.23267, p-value = 0.4998 lillie.test(y) ## ## Lilliefors (Kolmogorov-Smirnov) normality test ## ## data: y ## D = 0.20057, p-value = 0.7304 #Mann-Whitney-U Test #MWU Test: unpaired data x<-c(5,5,4,5,3) y<-c(22,14,5,17,30) #exact MWU Test, problem with ties wilcox.test(x,y) ## Warning in wilcox.test.default(x, y): cannot compute exact p-value with ## ties ## ## Wilcoxon rank sum test with continuity correction ## ## data: x and y ## W = 1.5, p-value = 0.02363 ## alternative hypothesis: true location shift is not equal to 0 #asymptotic MWU Test with continuity correction wilcox.test(x,y,exact=FALSE, correct=FALSE) ## ## Wilcoxon rank sum test ## ## data: x and y ## W = 1.5, p-value = 0.01775 ## alternative hypothesis: true location shift is not equal to 0 #asymptotic MWU Test without continuity correction wilcox.test(x,y,exact=FALSE, correct=TRUE) ## ## Wilcoxon rank sum test with continuity correction ## ## data: x and y ## W = 1.5, p-value = 0.02363 ## alternative hypothesis: true location shift is not equal to 0 #Calculating the ranks salve<-c(x,y) group<-c(rep("A",length(x)),rep("B",length(y))) data<-data.frame(salve,group) r1<-sum(rank(data$salve)[data$group=="A"]) r1 ## [1] 16.5 r2<-sum(rank(data$salve)[data$group=="B"])
    [Show full text]
  • Nonparametric Methods
    Chapter 6 Nonparametric Methods CHAPTER VI; SECTION A: INTRODUCTION TO NONPARAMETRIC METHODS Purposes of Nonparametric Methods: Nonparametric methods are uniquely useful for testing nominal (categorical) and ordinal (ordered) scaled data--situations where parametric tests are not generally available. An important second use is when an underlying assumption for a parametric method has been violated. In this case, the interval/ratio scale data can be easily transformed into ordinal scale data and the counterpart nonparametric method can be used. Inferential and Descriptive Statistics: The nonparametric methods described in this chapter are used for both inferential and descriptive statistics. Inferential statistics use data to draw inferences (i.e., derive conclusions) or to make predictions. In this chapter, nonparametric inferential statistical methods are used to draw conclusions about one or more populations from which the data samples have been taken. Descriptive statistics aren’t used to make predictions but to describe the data. This is often best done using graphical methods. Examples: An analyst or engineer might be interested to assess the evidence regarding: 1. The difference between the mean/median accident rates of several marked and unmarked crosswalks (when parametric Student’s t test is invalid because sample distributions are not normal). 2. The differences between the absolute average errors between two types of models for forecasting traffic flow (when analysis of variance is invalid because distribution of errors is not normal). 3. The relationship between the airport site evaluation ordinal rankings of two sets of judges, i.e., citizens and airport professionals. 4. The differences between neighborhood districts in their use of a regional mall for purposes of planning transit routes.
    [Show full text]
  • Null Hypothesis Against an Alternative Hypothesis
    Medical Statistics with R Dr. Gulser Caliskan Prof. Giuseppe Verlato Unit of Epidemiology and Medical Statistics Department of Diagnostics and Public Health University of Verona, Italy LESSON 4 INDEX 1. Independent T-Test 2. Wilcoxon Rank-Sum (Mann-Whitney U Test) 3. Paired T-Test 4. Wilcoxon Sign Test 5. ANOVA (Analysis of Varyans) 6. Kruskal Wallis H Test Classical Hypothesis Testing Test of a null hypothesis against an alternative hypothesis. There are five steps, the first four of which should be done before inspecting the data. Step 1. Declare the null hypothesis H0 and the alternative hypothesis H1. Types Of Hypotheses A hypothesis that completely specifies the parameters is called simple. If it leaves some parameter undetermined it is composite. A hypothesis is one-sided if it proposes that a parameter is > some value or < some value; it is two-sided if it simply says the parameter is /= some value. Types of Error Rejecting H0 when it is actually true is called a Type I Error. In biomedical settings it can be considered a false positive. (Null hypothesis says “nothing is happening” but we decide“there is disease”.) Step 2. Specify an acceptable level of Type I error, α, normally 0.05 or 0.01. This is the threshold used in deciding to reject H0 or not. If α = 0.05 and we determine the probability of our data assuming H0 is 0.0001, then we reject H0. The Test Statistic Step 3. Select a test statistic. This is a quantity calculated from the data whose value leads me to reject the null hypothesis or not.
    [Show full text]
  • Power Comparisons of Shapiro-Wilk, Kolmogorov-Smirnov, Lilliefors and Anderson-Darling Tests
    Journal of Statistical Modeling and Analytics Vol.2 No.1, 21-33, 2011 Power comparisons of Shapiro-Wilk, Kolmogorov-Smirnov, Lilliefors and Anderson-Darling tests Nornadiah Mohd Razali1 Yap Bee Wah1 1Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA, 40450 Shah Alam, Selangor, Malaysia E-mail: [email protected], [email protected] ABSTRACT The importance of normal distribution is undeniable since it is an underlying assumption of many statistical procedures such as t-tests, linear regression analysis, discriminant analysis and Analysis of Variance (ANOVA). When the normality assumption is violated, interpretation and inferences may not be reliable or valid. The three common procedures in assessing whether a random sample of independent observations of size n come from a population with a normal distribution are: graphical methods (histograms, boxplots, Q-Q-plots), numerical methods (skewness and kurtosis indices) and formal normality tests. This paper* compares the power of four formal tests of normality: Shapiro-Wilk (SW) test, Kolmogorov-Smirnov (KS) test, Lilliefors (LF) test and Anderson-Darling (AD) test. Power comparisons of these four tests were obtained via Monte Carlo simulation of sample data generated from alternative distributions that follow symmetric and asymmetric distributions. Ten thousand samples of various sample size were generated from each of the given alternative symmetric and asymmetric distributions. The power of each test was then obtained by comparing the test of normality statistics with the respective critical values. Results show that Shapiro-Wilk test is the most powerful normality test, followed by Anderson-Darling test, Lilliefors test and Kolmogorov-Smirnov test.
    [Show full text]
  • Normality Assumption in Statistical Data Analysis
    Data Science NORMALITY ASSUMPTION IN STATISTICAL DATA ANALYSIS S.Ya. Shatskikh1, L.E. Melkumova2 1Samara National Research University, Samara, Russia 2Mercury Development Russia, Samara, Russia Abstract. The article is devoted to normality assumption in statistical data analysis. It gives a short historical review of the development of scientific views on the normal law and its applications. It also briefly covers normality tests and analyzes possible consequences of using the normality assumption incorrectly. Keywords: normal law, normality assumption, normal distribution, Gaussian distribution, central limit theorem, normality tests. Citation: Shatskikh SYa, Melkumova LE. Normality assumption in statistical data analysis. CEUR Workshop Proceedings, 2016; 1638: 763-768. DOI: 10.18287/1613-0073-2016-1638-763-768 “Mechanism" of the central limit theorem Normal distribution can serve as a good approximation for processing observations if the random variable in question can be considered as a sum of a large number of in- dependent random variables 푋1, … , 푋푛, where each of the variables contributes to the common sum: 1 푥 푛 푛 2 푢2 2 1 − Lim ℙ {∑ (푋푘 − 푚푘)⁄(∑ 휎푘 ) ≤ 푥 } = ∫ 푒 2 푑푢, 푛→∞ √2휋 푘=1 푘=1 −∞ 2 필{푋푘} = 푚푘, 픻{푋푘} = 휎푘 < ∞. This theory was developed in the classical limit theorems, namely the De Moivre- Laplace theorem, the Lyapunov theorem and the Lindeberg-Feller theorem. By the beginning of the 20th century this argument already had a rather wide support in practice. Many practical experiments confirmed that the observed distributions obey the normal law. The general acknowledgement of the normal law at that point of time was mainly based on observations and measurement results in physical sciences and in particular in astronomy and geodesy.
    [Show full text]
  • VILACOMUN-THESIS-2017.Pdf
    Copyright by Alfredo Vila Común 2017 The Thesis Committee for Alfredo Vila Común Certifies that this is the approved version of the following thesis: Exploratory Study of Visual Management Application in Peruvian Construction Projects APPROVED BY SUPERVISING COMMITTEE: Supervisor: Carlos H. Caldas Fernanda Leite Exploratory Study of Visual Management Application in Peruvian Construction Projects by Alfredo Vila Común Thesis Presented to the Faculty of the Graduate School of The University of Texas at Austin in Partial Fulfillment of the Requirements for the Degree of Master of Science in Engineering The University of Texas at Austin December 2017 Dedication This thesis is dedicated to my beloved wife, for her love, patience, and continued encouragement; To my beloved family, for their support. Acknowledgements I would like to thank Dr. Carlos H. Caldas for all his support, guidance, availability, confidence, and wisdom to assist me, without which this thesis would not have been possible. I am extremely grateful to him. I would also like to thank Dr. Fernanda Leite for their advice and encouragement, and to all CEPM faculty for sharing their knowledge with me through classes and informal chats. I would also like to extend my gratitude to PRONABEC, without its scholarship programs this experience abroad would not have been possible for me. Thanks to all survey participants, especially the ones that allowed me to visit their projects, and share their knowledge to enrich this study. Finally, special thanks to Construction Engineering and Project Management students, who provided me with so much advice, friendship, and support, which really made graduate school a much more rewarding experience.
    [Show full text]