The Sum of Standardized Residuals: Goodness of Fit Test for Binary Response Model

Total Page:16

File Type:pdf, Size:1020Kb

The Sum of Standardized Residuals: Goodness of Fit Test for Binary Response Model The Sum of Standardized Residuals: Goodness of Fit Test for Binary Response Model Thesis Presented in Partial Fulfillment of the Requirements for the Degree Master of Science in the Graduate School of The Ohio State University By Lu Chen, B.S Graduate Program in Public Health The Ohio State University 2017 Thesis Committee: Grzegorz A. Rempala, Advisor Chuck Song c Copyright by Lu Chen 2017 Abstract Binary response model is frequently used to analyze binary outcome variables [1]. This popularity leads to an increase in statistical research on the model [1]. One area of current research is developing new goodness of fit (GOF) tests to evaluate whether a model fits the data [1]. In this paper, a new GOF test statistic, the sum of standardized residuals (Cn), is proposed and its asymptotic distribution is described by following Windmeijer's idea [2]. In addition, we illustrate, via numerical examples, the practical applications of the asymptotic result to some finite samples of size n and compare Cn's performance with some other currently used statistics. Our results demonstrate that, compared to other statistics, the overall performance of Cn is satisfying and stable, and Cn can be calculated easily and interpreted intuitively, unlike its other competitors. ii Vita 2014 . .B.S. Pharmacy, China Pharmaceutical Univerdsity 2015 to present . School of Public Health, The Ohio State University Fields of Study Major Field: Public Health iii Table of Contents Page Abstract . ii Vita . iii List of Tables . v 1. Introduction . 1 1.1 Some Currently Used Goodness-of-Fit Statistics . 1 1.1.1 Hosmer-Lemeshow Statistic . 1 1.1.2 Stukel Statistic . 3 1.1.3 Unweighted Sum of Squares Statistic . 3 1.1.4 Information Matrix Statistic . 4 2. Sum of Standardized Residual Statistic . 5 2.1 Regularity Assumptions . 5 2.2 The Asymptotic Distribution of Cn .................. 7 3. Numerical Examples . 8 3.0.1 Simulated Covariates . 9 3.0.2 Real Covariates . 11 3.0.3 Summary . 18 Bibliography . 19 Appendices 20 A. Proof of the Asymptotic Distribution of Cn . 20 iv List of Tables Table Page 3.1 Proportion H0 rejected at the α = 0:05 using sample size of 50, 100 and 1000 with 1000 replications using simulated covariates . 10 3.2 Selected variables description in the Boston Marathon data . 11 3.3 Proportion H0 rejected at the α = 0:05 using sample size of 445 with 1000 replications in quadratic models using the Boston Marathon data 13 3.4 Selected variables description in Cleveland data . 14 3.5 Proportion H0 rejected at the α = 0:05 using sample size of 303 with 1000 replications in interaction models using the Cleveland data . 15 3.6 Selected variables description in NHANES data . 16 3.7 Proportion H0 rejected at the α = 0:05 using sample size of 373 with 1000 replications in interaction models using the NHANES data . 17 v Chapter 1: Introduction The binary response model [3] is a popular method for analyzing binary data. In this model, we are interested in pi, where pi is defined as the probability that a binary random variable Yi = 1 given the covariates xi such that > pi = P (Yi = 1jxi) = F (xi β); (1.1) where β is the k-vector of coefficients and F is a distribution function. Some popular choices of F include normal (probit model) or logistic (logit model) distributions. After a model is fitted, we usually want to assess its goodness of fit (GOF). One of the several approaches is to perform a formal GOF test. We will introduce several currently used test statistics in the next section. Also, we will propose a new statistic, the sum of standardized residuals (Cn), and describe its asymptotic distribution in Chapter 2. In Chapter 3, we will illustrate, via numerical examples, the practical applications of the asymptotic result to some finite samples of size n and compare Cn's performance with other statistics. 1.1 Some Currently Used Goodness-of-Fit Statistics 1.1.1 Hosmer-Lemeshow Statistic In 1980, Hosmer and Lemeshow [4] proposed a grouping-based method to measure the goodness of fit for the binary response model. Specifically, the data are grouped 1 0 according to thep ^i s, the estimated values of pi. The observed and expected frequency are calculated for each group, and compared using the Pearson chi-square statistic to obtain g " 2 2 # X (O1k − Ebik) (O0k − Eb0k) HL = + k=1 Eb1k Eb0k th Pck where k denotes the k group, k = 1; 2; 3; :::; g. O1k = j=1 Yi, is the observed th number of events (Yi = 1) in the k group, where ck is the number of covariate th Pck patterns in the k group. Eb1k = j=1 mjp^j is the expected number of events in the th th k group, where mj is the number of observations in j covariate pattern, andp ^j is th Pck the estimated probability for j covariate pattern. Similarly, O0k = j=1(mj −Yj) is th Pck the observed number of non-events (Yi = 0) in the k group, and Eb0k = j=1 mj(1− th p^j) is the expected number of non-events in the k group. Since g = 10 and g = 20 are commonly suggested in practice, we consider using these two setting to perform HL tests in Section 3. With a bit of algebra, it can be shown that g 0 2 X (O1k − nkp¯k) HL = 0 n p¯k(1 − p¯k) k=1 k 0 th where nk is the number of observations in the k group,p ¯k is the average estimated probability in the kth group. When the number of groups g equals to the sample size n, the formula of HL can be further rewritten as n 2 X (Yi − p^i) HL = p^ (1 − p^ ) i=1 i i which suggests the formula for another goodness of fit measure: the sum of squared standardized residuals. 2 1.1.2 Stukel Statistic To perform Stukel GOF test [5], one or two additional covariates are included in the standard logistic model, and this new model is tested versus the original one. For > ^ ^ example, letp ^i = F (xi β) be the original model, where β is the vector of estimated 2 2 coefficients. Let za = gi I(gi ≥ 0) and zb = gi I(gi < 0) be the two covariates,where > ^ > ^ ^ gi = xi β. After including za and zb into the model, it becomesp ^i = F (xi β + βaza + ^ βbzb). Then, the Ward test or the Likelihood Ratio test can be performed to test the null hypothesis that both βa and βb are equal to 0 . 1.1.3 Unweighted Sum of Squares Statistic When some of the expected cell frequencies in the 2 × n contingency table are too small, the weighted chi-squared test for proportions cannot be used. In this case, Copas [6] modified the chi-squared test as n X 2 USS = (Yi − p^i) ; i=1 wherep ^i is defined as before, n is the sample size. Under the null hypothesis, Hosmer et al.[1] have proved that this statistic will have an asymptotic normal distribution when standardized by its mean and the square root of its variance: USS − trace(V^ ) d q −! N(0; 1) Vd ar(USS − trace(V^ )) where ^ ^> ^ ^ ^ Vd ar(USS − trace(V )) = di (I − M)V di with ^ di = 1 − 2^pi 3 ^ V = diag[^pi(1 − p^i)] ^ ^ > ^ −1 > M = V xi(xi V xi) xi 1.1.4 Information Matrix Statistic In 1982, White [7] introduced a general test for model misspecification. The basic idea of this test is to compare two different estimates of the covariance matrix. If these two estimates are the same, there is no lack of fit. The formula for Information Matrix statistic is: n p X X 2 IM = (Yi − p^i)(1 − 2p ^i)xij i=1 j=0 th th where xij is the j predictor for i observation. It has been shown by White that IM has a approximate chi-square distribution with p + 1 degree of freedom under the null hypothesis of the correct model. 4 Chapter 2: Sum of Standardized Residual Statistic As we mentioned in Section 1, the sum of squared standardized residuals is also a goodness of fit measure in binary response model n 2 n X (Yi − p^i) X T = = W : n p^ (1 − p^ ) i i=1 i i i=1 This statistic was proved to be asymptotically following the standard normal dis- tribution when properly standardized [2]. However, the drawback of Tn statistic is that the variances of its individual components Wi depend heavily on pi in such way that extreme values of pi could produce unbalanced variance across the sample. One intuitive way to address this issue is to assign a variance-stabilizing weight to Wi. This yields a new statistic n X Yi − p^i Cn = p : i=1 p^i(1 − p^i) We will analyze the asymptotic distribution of this statistic under the following as- sumptions. 2.1 Regularity Assumptions Following Windmeijer [2] we assume that 5 1. Both the first-order derivative f and second-order derivative f 0 of F are con- tinuous. There exists a positive M < 1 such that 0 < F (z) < 1 and f(z) > 0 for all jzj ≤ M. 2. β 2 B, the parameter space which is a open subset of the Euclidean space of the same dimension k as β. P > 3. There exists a finite number M such that jjxijj ≤ M; 8i 2 and lim xixi N n!1 i is a finite non-singular matrix.
Recommended publications
  • Goodness of Fit: What Do We Really Want to Know? I
    PHYSTAT2003, SLAC, Stanford, California, September 8-11, 2003 Goodness of Fit: What Do We Really Want to Know? I. Narsky California Institute of Technology, Pasadena, CA 91125, USA Definitions of the goodness-of-fit problem are discussed. A new method for estimation of the goodness of fit using distance to nearest neighbor is described. Performance of several goodness-of-fit methods is studied for time-dependent CP asymmetry measurements of sin(2β). 1. INTRODUCTION of the test is then defined as the probability of ac- cepting the null hypothesis given it is true, and the The goodness-of-fit problem has recently attracted power of the test, 1 − αII , is defined as the probabil- attention from the particle physics community. In ity of rejecting the null hypothesis given the alterna- modern particle experiments, one often performs an tive is true. Above, αI and αII denote Type I and unbinned likelihood fit to data. The experimenter Type II errors, respectively. An ideal hypothesis test then needs to estimate how accurately the fit function is uniformly most powerful (UMP) because it gives approximates the observed distribution. A number of the highest power among all possible tests at the fixed methods have been used to solve this problem in the confidence level. In most realistic problems, it is not past [1], and a number of methods have been recently possible to find a UMP test and one has to consider proposed [2, 3] in the physics literature. various tests with acceptable power functions. For binned data, one typically applies a χ2 statistic There is a long-standing controversy about the con- to estimate the fit quality.
    [Show full text]
  • Linear Regression: Goodness of Fit and Model Selection
    Linear Regression: Goodness of Fit and Model Selection 1 Goodness of Fit I Goodness of fit measures for linear regression are attempts to understand how well a model fits a given set of data. I Models almost never describe the process that generated a dataset exactly I Models approximate reality I However, even models that approximate reality can be used to draw useful inferences or to prediction future observations I ’All Models are wrong, but some are useful’ - George Box 2 Goodness of Fit I We have seen how to check the modelling assumptions of linear regression: I checking the linearity assumption I checking for outliers I checking the normality assumption I checking the distribution of the residuals does not depend on the predictors I These are essential qualitative checks of goodness of fit 3 Sample Size I When making visual checks of data for goodness of fit is important to consider sample size I From a multiple regression model with 2 predictors: I On the left is a histogram of the residuals I On the right is residual vs predictor plot for each of the two predictors 4 Sample Size I The histogram doesn’t look normal but there are only 20 datapoint I We should not expect a better visual fit I Inferences from the linear model should be valid 5 Outliers I Often (particularly when a large dataset is large): I the majority of the residuals will satisfy the model checking assumption I a small number of residuals will violate the normality assumption: they will be very big or very small I Outliers are often generated by a process distinct from those which we are primarily interested in.
    [Show full text]
  • Center for Multicultural & Global Mental Health Annual Report
    Center for Multicultural & Global Mental Health Annual Report: 2017-2018 WILLIAM JAMES COLLEGE One Wells Avenue Newton, MA 02459 617.327.6777 [email protected] www.williamjames.edu/cmgmh Center for Multicultural & Global Mental Health Annual Report: 2017-2018 Table of Contents Introduction……………………………………………………………………………………………2 Overview of the Center for Multicultural and Global Mental Health……………..…4 Mission…………………………………………………………………………………….......4 Vision Statement…………………………………………………………………………... 4 Strategic Goals & Objectives…………………………………………….…………….. 4 CMGMH’s Academic Concentrations …………………………………………………......... 6 African & Caribbean Mental Health ……………………..…………………….….... 6 Global Mental Health…………………….…………………………………….……....... 6 Latino Mental Health Program……………………………………………………….... 7 CMGMH’s Programs………………………………………………………………………………….8 The Black Mental Health Graduate Academy………………………..………….....8 Syrian Refugee Project………………………………………………………….……......8 Profile of Students in CMGMH’s Concentrations…………………………………………..9 CMGMH’s Fellows & Scholarship Awardees……..………………………………………...11 017-2018 Annual Report Faculty Presentations………………....…….………………………………………...…………16 Faculty & Student Awards and Recognitions…………………………….……………….18 Service Learning & Cultural Immersion Programs…………………....………………..20 Professional Development & Social Cultural Events……………………...…………...25 WJC in Action: Practicing What We Teach…..………………………………..…….……...32 Fall 2018 Conferences…………………………………………………………………………….35 Get Involved with CMGMH…………………………………………………………………….…36 A Note of
    [Show full text]
  • Chapter 2 Simple Linear Regression Analysis the Simple
    Chapter 2 Simple Linear Regression Analysis The simple linear regression model We consider the modelling between the dependent and one independent variable. When there is only one independent variable in the linear regression model, the model is generally termed as a simple linear regression model. When there are more than one independent variables in the model, then the linear model is termed as the multiple linear regression model. The linear model Consider a simple linear regression model yX01 where y is termed as the dependent or study variable and X is termed as the independent or explanatory variable. The terms 0 and 1 are the parameters of the model. The parameter 0 is termed as an intercept term, and the parameter 1 is termed as the slope parameter. These parameters are usually called as regression coefficients. The unobservable error component accounts for the failure of data to lie on the straight line and represents the difference between the true and observed realization of y . There can be several reasons for such difference, e.g., the effect of all deleted variables in the model, variables may be qualitative, inherent randomness in the observations etc. We assume that is observed as independent and identically distributed random variable with mean zero and constant variance 2 . Later, we will additionally assume that is normally distributed. The independent variables are viewed as controlled by the experimenter, so it is considered as non-stochastic whereas y is viewed as a random variable with Ey()01 X and Var() y 2 . Sometimes X can also be a random variable.
    [Show full text]
  • A Study of Some Issues of Goodness-Of-Fit Tests for Logistic Regression Wei Ma
    Florida State University Libraries Electronic Theses, Treatises and Dissertations The Graduate School 2018 A Study of Some Issues of Goodness-of-Fit Tests for Logistic Regression Wei Ma Follow this and additional works at the DigiNole: FSU's Digital Repository. For more information, please contact [email protected] FLORIDA STATE UNIVERSITY COLLEGE OF ARTS AND SCIENCES A STUDY OF SOME ISSUES OF GOODNESS-OF-FIT TESTS FOR LOGISTIC REGRESSION By WEI MA A Dissertation submitted to the Department of Statistics in partial fulfillment of the requirements for the degree of Doctor of Philosophy 2018 Copyright c 2018 Wei Ma. All Rights Reserved. Wei Ma defended this dissertation on July 17, 2018. The members of the supervisory committee were: Dan McGee Professor Co-Directing Dissertation Qing Mai Professor Co-Directing Dissertation Cathy Levenson University Representative Xufeng Niu Committee Member The Graduate School has verified and approved the above-named committee members, and certifies that the dissertation has been approved in accordance with university requirements. ii ACKNOWLEDGMENTS First of all, I would like to express my sincere gratitude to my advisors, Dr. Dan McGee and Dr. Qing Mai, for their encouragement, continuous support of my PhD study, patient guidance. I could not have completed this dissertation without their help and immense knowledge. I have been extremely lucky to have them as my advisors. I would also like to thank the rest of my committee members: Dr. Cathy Levenson and Dr. Xufeng Niu for their support, comments and help for my thesis. I would like to thank all the staffs and graduate students in my department.
    [Show full text]
  • Testing Goodness Of
    Dr. Wolfgang Rolke University of Puerto Rico - Mayaguez CERN Phystat Seminar 1 Problem statement Hypothesis testing Chi-square Methods based on empirical distribution function Other tests Power studies Running several tests Tests for multi-dimensional data 2 ➢ We have a probability model ➢ We have data from an experiment ➢ Does the data agree with the probability model? 3 Good Model? Or maybe needs more? 4 F: cumulative distribution function 퐻0: 퐹 = 퐹0 Usually more useful: 퐻0: 퐹 ∊ ℱ0 ℱ0 a family of distributions, indexed by parameters. 5 Type I error: reject true null hypothesis Type II error: fail to reject false null hypothesis A: HT has to have a true type I error probability no higher than the nominal one (α) B: probability of committing the type II error (β) should be as low as possible (subject to A) Historically A was achieved either by finding an exact test or having a large enough sample. p value = probability to reject true null hypothesis when repeating the experiment and observing value of test statistic or something even less likely. If method works p-value has uniform distribution. 6 Note above: no alternative hypothesis 퐻1 Different problem: 퐻0: 퐹 = 푓푙푎푡 vs 퐻0: 퐹 = 푙푖푛푒푎푟 → model selection Usually better tests: likelihood ratio test, F tests, BIC etc. Easy to confuse: all GOF papers do power studies, those need specific alternative. Our question: is F a good enough model for data? We want to guard against any alternative. 7 Not again … Actually no, GOF equally important to both (everybody has a likelihood) Maybe more so for Bayesians, no non- parametric methods.
    [Show full text]
  • Measures of Fit for Logistic Regression Paul D
    Paper 1485-2014 SAS Global Forum Measures of Fit for Logistic Regression Paul D. Allison, Statistical Horizons LLC and the University of Pennsylvania ABSTRACT One of the most common questions about logistic regression is “How do I know if my model fits the data?” There are many approaches to answering this question, but they generally fall into two categories: measures of predictive power (like R-square) and goodness of fit tests (like the Pearson chi-square). This presentation looks first at R-square measures, arguing that the optional R-squares reported by PROC LOGISTIC might not be optimal. Measures proposed by McFadden and Tjur appear to be more attractive. As for goodness of fit, the popular Hosmer and Lemeshow test is shown to have some serious problems. Several alternatives are considered. INTRODUCTION One of the most frequent questions I get about logistic regression is “How can I tell if my model fits the data?” Often the questioner is expressing a genuine interest in knowing whether a model is a good model or a not-so-good model. But a more common motivation is to convince someone else--a boss, an editor, or a regulator--that the model is OK. There are two very different approaches to answering this question. One is to get a statistic that measures how well you can predict the dependent variable based on the independent variables. I’ll refer to these kinds of statistics as measures of predictive power. Typically, they vary between 0 and 1, with 0 meaning no predictive power whatsoever and 1 meaning perfect predictions.
    [Show full text]
  • Goodness of Fit of a Straight Line to Data
    5. A linear trend. 10.4 The Least Squares Regression Line LEARNING OBJECTIVES 1. To learn how to measure how well a straight line fits a collection of data. 2. To learn how to construct the least squares regression line, the straight line that best fits a collection of data. 3. To learn the meaning of the slope of the least squares regression line. 4. To learn how to use the least squares regression line to estimate the response variable y in terms of the predictor variablex. Goodness of Fit of a Straight Line to Data Once the scatter diagram of the data has been drawn and the model assumptions described in the previous sections at least visually verified (and perhaps the correlation coefficient r computed to quantitatively verify the linear trend), the next step in the analysis is to find the straight line that best fits the data. We will explain how to measure how well a straight line fits a collection of points by examining how well the line y=12x−1 fits the data set Saylor URL: http://www.saylor.org/books Saylor.org 503 To each point in the data set there is associated an “error,” the positive or negative vertical distance from the point to the line: positive if the point is above the line and negative if it is below theline. The error can be computed as the actual y-value of the point minus the y-value yˆ that is “predicted” by inserting the x-value of the data point into the formula for the line: error at data point (x,y)=(true y)−(predicted y)=y−yˆ The computation of the error for each of the five points in the data set is shown in Table 10.1 "The Errors in Fitting Data with a Straight Line".
    [Show full text]
  • October 2002
    October 2002 Comcast Cup (Corporate) and club competition. PCVRC’s “A” Team placed 2nd and the “B” Team placed 3rd among the seven club teams that were entered. The “A” team consisted of Brian Driscoll, Tom Jermyn, David James, Theresa Cannon, and Kim A Real Classic! Moore. The “B” team consisted of Frank Barbera, Bob Hempton, Chris James, Diane Kukich, and Carla The 2002 Delaware Distance Pastore. The Delaware Running Company took home the Comcast Cup. Classic continued on page 3 by Dave Farren he 20th annual Delaware Distance TClassic was the most successful in years. Running in near-perfect conditions, 250 finishers crossed the finish line. This was the fifth consecu- tive year there was an increase in the number of finishers. The temperature at the start was 64 under overcast skies. Leading the way this year was Mike Leader and eventual winner Mike Digennaro set a blistering pace at Digennaro from the start of the 20th Annual DDC, clocking his first mile in 4:57! Newark, Del., with a time of 48:40. This was the fastest time run on the Riverfront since the race was moved there in 1999. Running about a minute behind was Mike Monagle, the new owner of the Delaware Running Company in Greenville. Leading the women was Kara–Lynn Kerr of Ardmore, Pa., in a time President’s Message page 3 of 56:36. Second among the women was newlywed Club Reminders page 3 Vicki Cauller in 59:02. A total of $750 in prize money was awarded, with the top three men and women and Foot Notes, New Members page 4 the top male and female masters taking home some Runner of the Quarter page 6 cash.
    [Show full text]
  • Using Minitab to Run a Goodness‐Of‐Fit Test 1. Enter the Values of A
    Using Minitab to run a Goodness‐of‐fit Test 1. Enter the values of a qualitative variable under C1. 2. Click on “Stat”, choose “Tables” and then “Chi‐square Goodness of Fit Test (One Variable)”. 3. Click on the circle next to “Categorical data:” and enter C1 in the box to the right. 4. Under “Tests”, click on the circle next to “Equal proportions”. 5. Click on the “Graphs” button and uncheck the boxes next to both types of bar chart. 6. Click on “OK” in that window and on “OK” in the next window. The test results will appear in the “Sessions” window under the heading “Chi‐square Goodness of Fit Test for Categorical Variable: … ”. The test statistic will appear under “Chi‐Sq”. The P‐value will appear under “P‐value”. Example (Navidi & Monk, Elementary Statistics, 2nd edition, #23 p.566): The significance level is 0.01. Since 1 – 0.01 = 0.99, the confidence is 99%. The null hypothesis is that all months are equally likely. The alternative is the months are not all equally likely. The data is shown in the table below. Month Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec Alarms 32 15 37 38 45 48 46 42 34 36 28 26 Open Minitab and enter the alarms values under C1. A portion of the entered data is shown below. ↓ C1 C2 Alarms 1 32 2 15 3 37 4 38 5 45 6 48 7 46 8 42 9 34 10 36 Now click on “Stat” and then choose “Tables” and “Chi‐Square Goodness‐of‐Fit Test (One Variable) …”.
    [Show full text]
  • Goodness-Of-Fit Tests for Parametric Regression Models
    Goodness-of-Fit Tests for Parametric Regression Models Jianqing Fan and Li-Shan Huang Several new tests are proposed for examining the adequacy of a family of parametric models against large nonparametric alternatives. These tests formally check if the bias vector of residuals from parametric ts is negligible by using the adaptive Neyman test and other methods. The testing procedures formalize the traditional model diagnostic tools based on residual plots. We examine the rates of contiguous alternatives that can be detected consistently by the adaptive Neyman test. Applications of the procedures to the partially linear models are thoroughly discussed. Our simulation studies show that the new testing procedures are indeed powerful and omnibus. The power of the proposed tests is comparable to the F -test statistic even in the situations where the F test is known to be suitable and can be far more powerful than the F -test statistic in other situations. An application to testing linear models versus additive models is also discussed. KEY WORDS: Adaptive Neyman test; Contiguous alternatives; Partial linear model; Power; Wavelet thresholding. 1. INTRODUCTION independent observations from a population, Parametric linear models are frequently used to describe Y m4x5 ˜1 ˜ N 401‘ 251 (1.1) the association between a response variable and its predic- D C tors. The adequacy of such parametric ts often arises. Con- ventional methods rely on residual plots against tted values where x is a p-dimensional vector and m4 5 is a smooth regres- sion surface. Let f 4 1 ˆ5 be a given parametri¢ c family. The or a covariate variable to detect if there are any system- ¢ atic departures from zero in the residuals.
    [Show full text]
  • What Critics Say- What Facts
    What Critics of Fluoride Say & What the Facts Say Opponents of water fluoridation make a lot of claims that are at odds with the facts. This document provides examples of what critics say, followed up with what the facts say. For each topic, a “Learn more” link can provide you with more detailed information. 1. Critics Say: “The FDA has never approved fluoride’s use in drinking water.”! THE FACTS: The FDA does not have the authority to regulate fluoride in public drinking water. The Environmental Protection Agency (EPA) performs this role, and it sets firm guidelines for the amount of fluoride. The concentration of fluoride used for water fluoridation is far below the limit established by the EPA. Learn more. 2. Critics Say: “A Harvard study showed that fluoride lowers IQ scores for children.” THE FACTS: It wasn't a Harvard study. It was a group of studies from China and Iran, where water fluoridation isn't even practiced. These studies were seriously flawed for several reasons—mostly because they measured fluoride levels that were far higher than the levels we use for fluoridation in America. A far better study with a much larger sample was published in 2014 by the American Journal of Public Health, and this study showed there was no link at all between fluoride in water and IQ scores. Learn more 3. Critics Say: “We deserve natural water. Nothing should be added to our water supply in order to medicate us.” THE FACTS: Fluoride is a mineral that exists naturally in water supplies. Many U.S.
    [Show full text]