Median of the P Value Under the Alternative Hypothesis

Total Page:16

File Type:pdf, Size:1020Kb

Median of the P Value Under the Alternative Hypothesis Median ofthe p Value Underthe Alternative Hypothesis Bhaskar BHATTACHARYA and Desale HABTZGHI ormoreextreme than, the value actually observed. Thus, if the p valueis lessthan the preferredsigni cance level, then one rejects Dueto absence of anuniversally acceptable magnitude of the H0.Over theyears several authors have attempted to properly TypeI error invarious elds, p valuesare used as a well- explain p values.Gibbons and Pratt (1975) provided interpre- recognizedtool in decision making in all areas of statistical tationand methodology of the p values.Recently, Schervish practice.The distribution of p valuesunder the null hypothe- (1996)treated the p valuesas signi cance probabilities. Discus- sisis uniform. However, under the alternative hypothesis the sions on p valuescan be foundin Blythand Staudte (1997) and distributionof the p valuesis skewed. The expected p value inDollinger, Kulinskaya, and Staudte (1996). However, these (EPV) hasbeen proposed by authorsto beusedas a measureof articlesdo nottreat the p valuesas random. theperformance of thetest. This article proposes the median of The p valuesare based on the test statistics used and hence the p values(MPV) whichis more appropriate for thispurpose. random.The stochastic nature of the p valueswas investigated Weworkout many examples to calculate the MPV’ s directly byDempster and Schatzoff (1965) and Schatzoff (1966) who andalso compare the MPV withthe EPV .Weconsidertesting equalityof distributionsagainst stochastic ordering in themulti- introduced“ expectedsigni cance value.” Recently, Sackrowitz nomialcase and compare the EPV’ s andMPV’ s bysimulation. andSamuel-Cahn (1999) investigated this concept further and Asecondsimulation study for generalcontinuous data is also renamedit as the expected p value(EPV). Underthe null hy- consideredfor twosamples with different test statistics for the pothesis,the p valueshave a uniformdistribution over (0, 1) for samehypotheses. In both cases MPV performs betterthan EPV . anysample size. Thus, under H0, EPV is 1/2 always, and there isno wayto distinguish p valuesderived from largestudies and KEY WORDS: Comparingtest statistics; Expected p value; thosefrom small-scalestudies. Also it would be impossibleun- Median p value;Power; Stochastic order. der H0 todifferentiatebetween studies well powered to detect apositedalternative hypothesis and the underpowered to detect thesame posited alternative value. Incontrast, the distribution of the p valuesunder the alter- 1.INTRODUCTION nativehypothesis is a functionof thesample size and the true parametervalue in the alternative hypothesis. As the p values Thetheory of hypothesistesting depends heavily on thepre- measureevidence against the null hypothesis, it is of interest to specied valueof the signi cance level. T oavoidthe nonunique- investigatethe behavior of the p valuesunder the alternative at nessof the decision of testing the same hypotheses using the varioussample sizes. W ereject H when p valueis smallwhich sametest statistic but different signi cance levels, it is apop- 0 isexpected when H istrue.As notedby Hung,O’ Neill,Bauer, ularchoice to report the p value. The p valueis the smallest 1 andKohne (1997), the distribution of the p valuesunder the levelof signicance at which an experimenterwould reject the alternativeis highlyskewed. The skewness increases with the nullhypothesis on thebasis of theobserved outcome. The user samplesize and the true parameter value under the alternative cancompare his/ herown signi cance level with the p value and reecting the ability to detectthe alternativeby increasingpower makehis/ herown decision. The p valuesare particularlyuseful underthese situations. Hence it ismore appropriate to consider incaseswhen the null hypothesis is well de ned but the alterna- themedian of the value(MPV) insteadof theEPV underthe tiveis not(e.g., composite) so thatT ypeII error considerations p areunclear. In this context to quote Fisher, “ Theactual value of p alternativeas a measureof thecenterof its distributionwhich is obtainablefrom thetable by interpolation indicates the strength themain focus of thisarticle. Applications of thedistribution of ofevidenceagainst the null hypothesis.” the p valuesunder the alternative in thearea of meta-analysisof Wewillconsider tests of theform “Reject H when T c,” severalstudies is consideredby Hunget al.(1997). Studying the 0 ¶ where T isareal-valuedtest statistic computed from datawhen p valueunder the alternative is alsobene cial over the power of atestand is explainedin thenext paragraph. testingthe null hypothesis H0 againstthe alternative H1. The value c isdetermined from theprespeci ed sizerestrictions such Whenseveral test procedures are available for thesame test- that P (T c) = ¬ .Of course,when T isadiscreterandom ingsituation one compares them by meansof power.However, H0 ¶ variable,one needs to adoptrandomization so that all sizes are powercalculations depend on the chosen signi cance levels, possible.If t istheobserved value of T andthedistribution of T andin discrete cases involves randomization. These steps can under H is given by F ( ), then the p valueis givenby 1 F (t) beavoidedby considering MPV’ s. Also as thepower functions 0 0 ¢ ¡ 0 whichis theprobabilityof ndingthe test statistic as extremeas, dependon the chosen signi cance levels, it isdifcult to compare themwhen different power functions use different signi cance levels.On the otherhand, MPV’ s dependonly on thealternative BhaskarBhattacharya is Associate Professor,Department ofMathematics, andnot on the signi cance level. The smaller the valueof MPV , SouthernIllinois University, Carbondale, IL 62901(E-mail: Bhaskar@math. siu.edu).Desale Habtzghiis aGraduate Student,Department ofStatistics, Uni- thestronger the test. The value of an MPV cantell us which versityof Georgia,Athens, GA 30602. alternativean attained p valuebest represents for agivensam- 202The American Statistician, August 2002, V ol.56, No. 3 c 2002American Statistical Association DOI: 10.1198/000313002146 ® 1 plesize. Also, it helpsto knowthe behavior of theMPV’ s for Since F (t) F (t); t,itisseen that F (F ¡ (:5)) :5 which ³ µ 0 8 0 ³ ¶ varyingsample sizes. impliesthat ¬ :5 andequality holds when H is true. The ¤ µ 0 InSection 2, we obtaina generalexpression for theMPV’ s. smallerthe MPV thebetter it istodetectthe alternative. Given Wealsoderive a computationallyfavorable form tobe used thestochastic nature of the p valueunder the alternative, it is laterin thearticle. In Section 3, we considerseveral examples to alsotrue that the MPV issmallerthan the EPV andhence clearly computethe MPV’ s directly.In Section 4, we perform asimu- preferableover the EPV .TheMPV beingsmaller than the EPV lationstudy to compute the MPV’s inthecase of testingagainst producesa smallerindifference region in thesense that higher stochasticordering for multinomialdistributions. W ealsoper- poweris generated closer to the null hypothesis region using form asecondsimulation study for generaldistributions when MPV thanwith using EPV . testingthe same hypotheses with two samples using three test Itis alsopossible to expressthe MPV inanother way. Let T ¹ statisticsof t test,Mann– Whitney– Wilcoxon, and Kolmogorov– F ( ) and,independently, T F ( ).If theobserved value of ³ ¢ ¤ ¹ 0 ¢ Smirnov.In Section 5, we makeconcluding remarks. T is t, then the p valueis simply g(t) = P (T t T = t). The ¤ ¶ j MPV is 2. VALUE ASARANDOM VARIABLE AND ITS p med g(t) = MPV(³ ) = P (T ¤ med T ): (3) MEDIAN ¶ If H0 istrue, then T and T ¤ areidentically distributed, and hence For thetest statistic T withdistribution F0( ) under H0, let theabove probability is .5 for anycontinuous distribution. For 1 ¢ discretedistributions, although the above expressions still hold, F³ ( ) beitsdistribution under H1. Also, let F0¡ ( ) betheinverse ¢ 1 ¢ function of F ( ), so that, F (F ¡ (® )) = ® , for any 0 < ® < 1. MPV willbe slightly higher as P (T ¤ = med T ) > 0. For an 0 ¢ 0 0 Since the p valueis theprobability of observinga moreextreme UMP test,the MPV willbe uniformlyminimal for all ³ values valuethan the observedtest statistic value, as a randomvariable inthe alternative as comparedto theMPV’ s ofanyother test of itcanbe expressedas the same H0 versus H1.For computationalpurposes the above form oftheMPV in(3) isvery useful. X = 1 F0(T ): ¡ 3. EXAMPLES As F (T ) U(0; 1) under H , so is X.Thepower of the test 0 ¹ 0 Thissection considers several examples to calculate the isrelatedto the p value as MPV’s directly. ­ = P³ (X ¬ ) µ Example 1. Let X1; X2; : : : ; Xn bearandomsample of size n 1 2 = 1 F³ (F0¡ (1 ¬ )): (1) from a N(· ; ¼ ) distribution,and we liketo test H : · · ¡ ¡ 0 µ 0 versus H : · > · where ¼ isknown. Using the test statistic Notethe above is also the distribution function of the p value 1 0 T = X anda particularvalue · in H itfollows from (2) that underthe alternative. As T isstochastically larger under the 1 1 alternativethan under the null hypothesis, it followsthat the p pn(· · ) MPV = © 0 ¡ 1 ; (4) valueunder the alternative is stochastically smaller than under ¼ thenull (Lehmann 1986). This explains why the distribution of µ ¶ the p valueis skewedto therightunder the alternative. Hence to where © isthecdf of thestandard normal distribution. It iswell estimatethe center of thedistribution of the p value,the median knownthat for asize ¬ testto achieve power ­ ata specied isa betterchoice than the mean. The median is any value of alternativevalue · 1 thesample size n satis es ¬ = ¬ ¤ whichsatis es 2 2 (z1 ¬ + z­
Recommended publications
  • Hypothesis Testing and Likelihood Ratio Tests
    Hypottthesiiis tttestttiiing and llliiikellliiihood ratttiiio tttesttts Y We will adopt the following model for observed data. The distribution of Y = (Y1, ..., Yn) is parameter considered known except for some paramett er ç, which may be a vector ç = (ç1, ..., çk); ç“Ç, the paramettter space. The parameter space will usually be an open set. If Y is a continuous random variable, its probabiiillliiittty densiiittty functttiiion (pdf) will de denoted f(yy;ç) . If Y is y probability mass function y Y y discrete then f(yy;ç) represents the probabii ll ii tt y mass functt ii on (pmf); f(yy;ç) = Pç(YY=yy). A stttatttiiistttiiicalll hypottthesiiis is a statement about the value of ç. We are interested in testing the null hypothesis H0: ç“Ç0 versus the alternative hypothesis H1: ç“Ç1. Where Ç0 and Ç1 ¶ Ç. hypothesis test Naturally Ç0 § Ç1 = ∅, but we need not have Ç0 ∞ Ç1 = Ç. A hypott hesii s tt estt is a procedure critical region for deciding between H0 and H1 based on the sample data. It is equivalent to a crii tt ii call regii on: a critical region is a set C ¶ Rn y such that if y = (y1, ..., yn) “ C, H0 is rejected. Typically C is expressed in terms of the value of some tttesttt stttatttiiistttiiic, a function of the sample data. For µ example, we might have C = {(y , ..., y ): y – 0 ≥ 3.324}. The number 3.324 here is called a 1 n s/ n µ criiitttiiicalll valllue of the test statistic Y – 0 . S/ n If y“C but ç“Ç 0, we have committed a Type I error.
    [Show full text]
  • IMS Presidential Address(Es) Each Year, at the End of Their Term, the IMS CONTENTS President Gives an Address at the Annual Meeting
    Volume 50 • Issue 6 IMS Bulletin September 2021 IMS Presidential Address(es) Each year, at the end of their term, the IMS CONTENTS President gives an address at the Annual Meeting. 1 IMS Presidential Addresses: Because of the pandemic, the Annual Meeting, Regina Liu and Susan Murphy which would have been at the World Congress in Seoul, didn’t happen, and Susan Murphy’s 2–3 Members’ news : Bin Yu, Marloes Maathuis, Jeff Wu, Presidential Address was postponed to this year. Ivan Corwin, Regina Liu, Anru The 2020–21 President, Regina Liu, gave her Zhang, Michael I. Jordan Address by video at the virtual JSM, which incorpo- 6 Profile: Michael Klass rated Susan’s video Address. The transcript of the two addresses is below. 8 Recent papers: Annals of 2020–21 IMS President Regina Liu Probability, Annals of Applied Probability, ALEA, Probability Regina Liu: It’s a great honor to be the IMS President, and to present this IMS and Mathematical Statistics Presidential address in the 2021 JSM. Like many other things in this pandemic time, this year’s IMS Presidential Address will be also somewhat different from those in the 10 IMS Awards past. Besides being virtual, it will be also given jointly by the IMS Past President Susan 11 IMS London meeting; Olga Murphy and myself. The pandemic prevented Susan from giving her IMS Presidential Taussky-Todd Lecture; Address last year. In keeping with the IMS tradition, I invited Susan to join me today, Observational Studies and I am very pleased she accepted my invitation. Now, Susan will go first with her 12 Sound the Gong: Going 2020 IMS Presidential Address.
    [Show full text]
  • Applied Biostatistics Mean and Standard Deviation the Mean the Median Is Not the Only Measure of Central Value for a Distribution
    Health Sciences M.Sc. Programme Applied Biostatistics Mean and Standard Deviation The mean The median is not the only measure of central value for a distribution. Another is the arithmetic mean or average, usually referred to simply as the mean. This is found by taking the sum of the observations and dividing by their number. The mean is often denoted by a little bar over the symbol for the variable, e.g. x . The sample mean has much nicer mathematical properties than the median and is thus more useful for the comparison methods described later. The median is a very useful descriptive statistic, but not much used for other purposes. Median, mean and skewness The sum of the 57 FEV1s is 231.51 and hence the mean is 231.51/57 = 4.06. This is very close to the median, 4.1, so the median is within 1% of the mean. This is not so for the triglyceride data. The median triglyceride is 0.46 but the mean is 0.51, which is higher. The median is 10% away from the mean. If the distribution is symmetrical the sample mean and median will be about the same, but in a skew distribution they will not. If the distribution is skew to the right, as for serum triglyceride, the mean will be greater, if it is skew to the left the median will be greater. This is because the values in the tails affect the mean but not the median. Figure 1 shows the positions of the mean and median on the histogram of triglyceride.
    [Show full text]
  • STATISTICS in the DATA SCIENCE ERA: a Symposium to Celebrate 50 Years of Statistics at the University of Michigan
    STATISTICS IN THE DATA SCIENCE ERA: A Symposium to Celebrate 50 Years of Statistics at the University of Michigan September 20-21, 2019 Rackham Graduate School 915 E. Washington Street Ann Arbor, MI 48109 A Symposium to Celebrate 50 Years of Statistics at the University of Michigan 1 PROGRAM Friday, September 20, 2019 Saturday, September 21, 2019 9:30-10:30 ...............Coffee and Registration 8:30-9:00 ................ Coffee 10:30-10:50 ..............Opening Remarks Invited Session 2 Chair: Snigdha Panigrahi, University of Michigan Xuming He and Dean Anne Curzan 9:00-9:30 ................ Adam Rothman, University of Minnesota Shrinking Characteristics of Precision Matrix Keynote Session 1 Chair: Ambuj Tewari, University of Michigan Estimators 10:50-11:50 ...............Susan Murphy, Harvard University Online Experimentation with Learning Algorithms 9:30-10:00 ............... Bodhisattva Sen, Columbia University in a Clinical Trial Multivariate Rank-based Distribution-free Nonparametric Testing using Measure 11:50-12:00 ...............Group Photo Transportation PROGRAM 12:00-1:30 ................Lunch and Poster Session 10:00-10:30 ............. Min Qian, Columbia University Personalized Policy Learning using Longitudinal University of Michigan Invited Session 1 Chair: Ziwei Zhu, Mobile Health Data 1:30-2:00 .................Sumanta Basu, Cornell University Large Spectral Density Matrix Estimation by 10:30-11:00 .............. Coffee Break Thresholding Invited Session 3 Chair: Kean Ming Tan, University of Michigan 2:00-2:30 .................Anindya Bhadra, Purdue University 11:00-11:30 ............... Ali Shojaie, University of Washington Horseshoe Regularization for Machine Learning in Incorporating Auxiliary Information into Learning Complex and Deep Models Directed Acyclic Graphs 2:30-3:00 .................Coffee Break 11:30-12:00 ..............
    [Show full text]
  • The Completely Sufficient Statistician (Css)
    Kansas State University Libraries New Prairie Press Conference on Applied Statistics in Agriculture 2002 - 14th Annual Conference Proceedings THE COMPLETELY SUFFICIENT STATISTICIAN (CSS) Ralph G. O'Brien Follow this and additional works at: https://newprairiepress.org/agstatconference Part of the Agriculture Commons, and the Applied Statistics Commons This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License. Recommended Citation O'Brien, Ralph G. (2002). "THE COMPLETELY SUFFICIENT STATISTICIAN (CSS)," Conference on Applied Statistics in Agriculture. https://doi.org/10.4148/2475-7772.1196 This is brought to you for free and open access by the Conferences at New Prairie Press. It has been accepted for inclusion in Conference on Applied Statistics in Agriculture by an authorized administrator of New Prairie Press. For more information, please contact [email protected]. Conference on Applied Statistics in Agriculture Kansas State University Applied Statistics in Agriculture 1 THE COMPLETELY SUFFICIENT STATISTICIAN (CSS) Ralph G. O'Brien Department ofBiostatistics and Epidemiology Cleveland Clinic Foundation, Cleveland, OH 44195 Today's ideal statistical scientist develops and maintains a broad range of technical skills and personal qualities in four domains: (1) numeracy, in mathematics and numerical computing; (2) articulacy and people skills; (3) literacy, in technical writing and in programm,ing; and (4) graphicacy. Yet, many of these components are given short shrift in university statistics programs. Can even the best statistician today really be "completely sufficient?" Introduction Let me start by saying that I was absolutely delighted, grateful, and humbled to be invited to give the workshop and keynote address at this meeting.
    [Show full text]
  • TR Multivariate Conditional Median Estimation
    Discussion Paper: 2004/02 TR multivariate conditional median estimation Jan G. de Gooijer and Ali Gannoun www.fee.uva.nl/ke/UvA-Econometrics Department of Quantitative Economics Faculty of Economics and Econometrics Universiteit van Amsterdam Roetersstraat 11 1018 WB AMSTERDAM The Netherlands TR Multivariate Conditional Median Estimation Jan G. De Gooijer1 and Ali Gannoun2 1 Department of Quantitative Economics University of Amsterdam Roetersstraat 11, 1018 WB Amsterdam, The Netherlands Telephone: +31—20—525 4244; Fax: +31—20—525 4349 e-mail: [email protected] 2 Laboratoire de Probabilit´es et Statistique Universit´e Montpellier II Place Eug`ene Bataillon, 34095 Montpellier C´edex 5, France Telephone: +33-4-67-14-3578; Fax: +33-4-67-14-4974 e-mail: [email protected] Abstract An affine equivariant version of the nonparametric spatial conditional median (SCM) is con- structed, using an adaptive transformation-retransformation (TR) procedure. The relative performance of SCM estimates, computed with and without applying the TR—procedure, are compared through simulation. Also included is the vector of coordinate conditional, kernel- based, medians (VCCMs). The methodology is illustrated via an empirical data set. It is shown that the TR—SCM estimator is more efficient than the SCM estimator, even when the amount of contamination in the data set is as high as 25%. The TR—VCCM- and VCCM estimators lack efficiency, and consequently should not be used in practice. Key Words: Spatial conditional median; kernel; retransformation; robust; transformation. 1 Introduction p s Let (X1, Y 1),...,(Xn, Y n) be independent replicates of a random vector (X, Y ) IR IR { } ∈ × where p > 1,s > 2, and n>p+ s.
    [Show full text]
  • On the Scale Parameter of Exponential Distribution
    Review of the Air Force Academy No.2 (34)/2017 ON THE SCALE PARAMETER OF EXPONENTIAL DISTRIBUTION Anca Ileana LUPAŞ Military Technical Academy, Bucharest, Romania ([email protected]) DOI: 10.19062/1842-9238.2017.15.2.16 Abstract: Exponential distribution is one of the widely used continuous distributions in various fields for statistical applications. In this paper we study the exact and asymptotical distribution of the scale parameter for this distribution. We will also define the confidence intervals for the studied parameter as well as the fixed length confidence intervals. 1. INTRODUCTION Exponential distribution is used in various statistical applications. Therefore, we often encounter exponential distribution in applications such as: life tables, reliability studies, extreme values analysis and others. In the following paper, we focus our attention on the exact and asymptotical repartition of the exponential distribution scale parameter estimator. 2. SCALE PARAMETER ESTIMATOR OF THE EXPONENTIAL DISTRIBUTION We will consider the random variable X with the following cumulative distribution function: x F(x ; ) 1 e ( x 0 , 0) (1) where is an unknown scale parameter Using the relationships between MXXX( ) ; 22( ) ; ( ) , we obtain ()X a theoretical variation coefficient 1. This is a useful indicator, especially if MX() you have observational data which seems to be exponential and with variation coefficient of the selection closed to 1. If we consider x12, x ,... xn as a part of a population that follows an exponential distribution, then by using the maximum likelihood estimation method we obtain the following estimate n ˆ 1 xi (2) n i1 119 On the Scale Parameter of Exponential Distribution Since M ˆ , it follows that ˆ is an unbiased estimator for .
    [Show full text]
  • 5. the Student T Distribution
    Virtual Laboratories > 4. Special Distributions > 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 5. The Student t Distribution In this section we will study a distribution that has special importance in statistics. In particular, this distribution will arise in the study of a standardized version of the sample mean when the underlying distribution is normal. The Probability Density Function Suppose that Z has the standard normal distribution, V has the chi-squared distribution with n degrees of freedom, and that Z and V are independent. Let Z T= √V/n In the following exercise, you will show that T has probability density function given by −(n +1) /2 Γ((n + 1) / 2) t2 f(t)= 1 + , t∈ℝ ( n ) √n π Γ(n / 2) 1. Show that T has the given probability density function by using the following steps. n a. Show first that the conditional distribution of T given V=v is normal with mean 0 a nd variance v . b. Use (a) to find the joint probability density function of (T,V). c. Integrate the joint probability density function in (b) with respect to v to find the probability density function of T. The distribution of T is known as the Student t distribution with n degree of freedom. The distribution is well defined for any n > 0, but in practice, only positive integer values of n are of interest. This distribution was first studied by William Gosset, who published under the pseudonym Student. In addition to supplying the proof, Exercise 1 provides a good way of thinking of the t distribution: the t distribution arises when the variance of a mean 0 normal distribution is randomized in a certain way.
    [Show full text]
  • A R Software for Wavelets and Statistics
    A R Software for Wavelets and Statistics Here is a list and a brief description of some important R packages related to/that make use of wavelets. Such packages are important as they enable ideas to be reproduced, checked, and modified by the scientific community. This is probably just as important as the advertisement of the ideas through scientific papers and books. This is the ‘reproducible research’ view expressed by Buckheit and Donoho (1995). Making software packages freely available is an extremely valuable service to the community and so we choose to acclaim the author(s) of each package by displaying their names! The descriptions are extracted from each package description from CRAN. The dates refer to the latest updates on CRAN, not the original publication date. Please let me know of any omissions. adlift performs adaptive lifting (generalized wavelet transform) for irregu- larly spaced data, see Nunes et al. (2006). Written by Matt Nunes and Marina Knight, University of Bristol, UK, 2006. brainwaver computes a set of scale-dependent correlation matrices between sets of preprocessed functional magnetic resonance imaging data and then performs a ‘small-world’ analysis of these matrices, see Achard et al. (2006). Written by Sophie Achard, Brain Mapping Unit, University of Cambridge, UK, 2007. CVThresh carries out level-dependent cross-validation for threshold selection in wavelet shrinkage, see Kim and Oh (2006). Written by Donghoh Kim, Hongik University and Hee-Seok Oh, Seoul National University, Korea, 2006. DDHFm implements the data-driven Haar–Fisz transform as described in 6.5, see references therein. Written by Efthimios Motakis, Piotr Fryzlewicz and Guy Nason, University of Bristol, Bristol, UK.
    [Show full text]
  • The Probability Lifesaver: Order Statistics and the Median Theorem
    The Probability Lifesaver: Order Statistics and the Median Theorem Steven J. Miller December 30, 2015 Contents 1 Order Statistics and the Median Theorem 3 1.1 Definition of the Median 5 1.2 Order Statistics 10 1.3 Examples of Order Statistics 15 1.4 TheSampleDistributionoftheMedian 17 1.5 TechnicalboundsforproofofMedianTheorem 20 1.6 TheMedianofNormalRandomVariables 22 2 • Greetings again! In this supplemental chapter we develop the theory of order statistics in order to prove The Median Theorem. This is a beautiful result in its own, but also extremely important as a substitute for the Central Limit Theorem, and allows us to say non- trivial things when the CLT is unavailable. Chapter 1 Order Statistics and the Median Theorem The Central Limit Theorem is one of the gems of probability. It’s easy to use and its hypotheses are satisfied in a wealth of problems. Many courses build towards a proof of this beautiful and powerful result, as it truly is ‘central’ to the entire subject. Not to detract from the majesty of this wonderful result, however, what happens in those instances where it’s unavailable? For example, one of the key assumptions that must be met is that our random variables need to have finite higher moments, or at the very least a finite variance. What if we were to consider sums of Cauchy random variables? Is there anything we can say? This is not just a question of theoretical interest, of mathematicians generalizing for the sake of generalization. The following example from economics highlights why this chapter is more than just of theoretical interest.
    [Show full text]
  • LORI MCLEOD, Phd
    LORI MCLEOD, PhD Head, Psychometrics Education PhD, Quantitative Psychology, University of North Carolina, Chapel Hill, NC MA, Quantitative Psychology, University of North Carolina, Chapel Hill, NC BS, Statistics, Mathematics Education (graduated summa cum laude), North Carolina State University, Raleigh, NC Summary of Professional Experience Lori McLeod, PhD, is Head of Psychometrics at RTI-HS. Dr. McLeod is a psychometrician with more than 15 years of experience in instrument development and validation, as well as experience conducting systematic assessments of clinical and economic literature and developing appropriate health outcome strategies. In her Psychometrics role, she has conducted many psychometric evaluations of both paper-and-pencil and computer-administered instruments. These investigations have included the assessment of scale reliability, validity, responsiveness, and work to identify PRO responders. In addition, Dr. McLeod has experience conducting and analyzing data from clinical trials and observational studies, including data to document the burden of disease. Dr. McLeod has published numerous related manuscripts in Applied Psychological Measurement, Pharmacoeconomics, Quality of Life Research, and Psychometrika. She has experience in a wide variety of therapeutic areas, including chronic pain, dermatology, oncology, psychiatry, respiratory, sleep disorders, urology, and sexual dysfunction. Dr. McLeod currently serves as the co-leader of the Psychometric Special Interest Group for the International Society on Quality
    [Show full text]
  • Notes Mean, Median, Mode & Range
    Notes Mean, Median, Mode & Range How Do You Use Mode, Median, Mean, and Range to Describe Data? There are many ways to describe the characteristics of a set of data. The mode, median, and mean are all called measures of central tendency. These measures of central tendency and range are described in the table below. The mode of a set of data Use the mode to show which describes which value occurs value in a set of data occurs most frequently. If two or more most often. For the set numbers occur the same number {1, 1, 2, 3, 5, 6, 10}, of times and occur more often the mode is 1 because it occurs Mode than all the other numbers in the most frequently. set, those numbers are all modes for the data set. If each number in the set occurs the same number of times, the set of data has no mode. The median of a set of data Use the median to show which describes what value is in the number in a set of data is in the middle if the set is ordered from middle when the numbers are greatest to least or from least to listed in order. greatest. If there are an even For the set {1, 1, 2, 3, 5, 6, 10}, number of values, the median is the median is 3 because it is in the average of the two middle the middle when the numbers are Median values. Half of the values are listed in order. greater than the median, and half of the values are less than the median.
    [Show full text]