Extensions to the Kruskal-Wallis Test and a Generalised Median Test

Total Page:16

File Type:pdf, Size:1020Kb

Extensions to the Kruskal-Wallis Test and a Generalised Median Test c Journal of Applied Mathematics Decision Sciences Reprints Available directly from the Editor Printed in New Zealand Extensions to the KruskalWallis Test and a Generalised Median Test with Extensions JCW RAYNER john rayneruoweduau Department of Applied Statistics University of Wol longong Northelds Avenue NSW Australia DJ BEST johnbbiomsyddfstcsiroau CSIRO Mathematical and Information Sciences PO Box North Ryde NSW Australia Abstract The data for the tests considered here may b e presented in twowaycontingency ta bles with all marginal totals xed Weshow that Pearsons test statistic X P for Pearson may P b e partitioned into useful and informative comp onents The rst detects lo cation dierences b e tween the treatments and the subsequent comp onents detect disp ersion and higher order moment dierences For KruskalWallistyp e data when there are no ties the lo cation comp onent is the KruskalWallis test The subsequent comp onents are the extensions Our approach enables us to generalise to when there are ties and to when there is a xed numb er of categories and a large numb er of observations We also prop ose a generalisation of the wellknown median test In this reduces to the usual median test statistic situation the lo cationdetecting rst comp onentofX P when there are only two categories Subsequent comp onents detect higher moment departures from the null hyp othesis of equal treatmen t eects Keywords Categorical data Comp onents Nonparametric tests Orthonormal p olynomial Twoway data Intro duction The idea of decomp osing a test into orthogonal contrasts as in the analysis of variance has long b een appreciated by statisticians as a way of making hyp othesis tests more informative In the authors smo oth go o dness of t work see Rayner and Best a similar approachis pursued Omnibus test statistics are par titioned into smo oth comp onents We dene the comp onents of a test statistic to be asymptotically pairwise indep endent with each asymptotically having the chisquared distribution and suchthat their sum gives the original test statistic ts provide powerful directional tests and permit a convenient and The comp onen informative scrutinyofthe data This approachis applied to Sp earmans test in Best and Rayner and Rayner and Best a Rayner and Best b gaveanoverview of this approach applied to several commonly used nonparametric tests including the Friedman and Durbin tests Data for a generalisation of the median test that we subsequently prop ose and for the KruskalWallis test both with and without ties may be presented in the form of twoway tables with xed marginal totals Wederive the covariance matrix JCW RAYNER AND DJ BEST of entries in such tables and then partition a multiple of X into comp onents that P detect lo cation and higher moment dierences b etween rows For KruskalWallistyp e data when there are no ties the lo cation comp onentis the KruskalWallis test Our approach enables us to generalise to when there are ties and to when there is a xed number of categories and a large number of observations We also prop ose a generalisation of the wellknown median test The lo cation detecting rst comp onentofX reduces to the usual median test statistic P when there are only two categories Using more categories allows comp onents other than this lo cation comp onent to b e calculated These additional comp onents that detect disp ersion and higher moment eects are not available when using the usual median test The structure of this pap er is as follows In the next section the mo del for two way contingency tables with xed marginal totals is given and Pearsons X is P derived as a test statistic for the null hyp othesis of like rows In section three a multiple of X is partitioned into comp onents The material in section and P will b e familiar to many readers but is necessary background for the new work In section four it is shown that when there are no ties the rst comp onent is the usual KruskalWallis statistic The nonlo cation detecting comp onents are our extensions Section ve generalises the treatment to when there are ties Section six intro duces a generalisation of the usual median X test whichisthus identied as a lo cation detecting test the extensions p ermit disp ersion and other eects to b e detected A Mo del and Pearsons X Test Supp ose wehavea twoway table of counts N with i r and j c ij ely n i r and n j c The row and column totals resp ectiv i j are known constants Under the null hyp othesis of simple random sampling the likeliho o d was given byRoy and Mitra as r c r c Y Y Y Y n n n n i j ij i j i j P P in which n n n is the grand total of the observations The mo d i j i j els for tables with just one set of marginal totals xed or only the grand total xed are quite dierent from our mo del in which all row and column totals are xed See Lancaster chapter XI section pp This likeliho o d can b e expressed as a pro duct of extended or multivariate hyp ergeometric probability functions r c Y Y n n n n j ij i C C n n ij i i j taken with resp ect to the To nd moments of the N exp ectations may be ij distribution of the second row conditional on knowledge of the column sums of the EXTENDED KRUSKALWALLIS AND MEDIAN TESTS rst tworows then conditional on the column sums of the rst three rows and so on It suces to know the moments of the extended hyp ergeometric distribution Details are given in the App endix Wend E N n n n i r and j c ij i j T T T T Write N N N i r and N N N so that N is i i ic r the vector of all the cell counts The jointcovariance matrix of N and N is for i j i j n n n n n n r r s i j diag cov N N i j n n n Write f n n j c and j j n n n n r r s R diag n n The covariance matrix of N is cov N fdiag f f f gR where is j i j the direct or Kronecker pro duct See Lancaster for details ab out direct or Kronecker sums and pro ducts Now dene the standardised cell counts q Z N E N E N i r and j c ij ij ij ij T Z Z Z Z Z c r rc I the a by a identity matrix and the a byvector with every element Then a a q cov Z fI f f gR r i j p The matrix fI f f g has r latent ro ots and one latent ro ot zero The r i j latent ro ots of R are dicult to nd in general but their asymptotic limits follow from Lancaster Chapter V Lancaster showed that the quadratic form with vector the standardised cell counts and matrix essentially R is the familiar Pearson go o dness of t statistic with asymptotic distribution Hence the c latent ro ots of R are asymptotically one c times and zero once So under the null hyp othesis of simple random sampling Z has zero mean and covariance matrix cov Z which asymptotically has r c latent ro ots one and the remaining r c latent ro ots zero In the well known and often used classical mo del r and c are xed and the total coun t n The test statistic X is given by P r c X X n n n n i j i j T Z Z N X ij P n n i j Wenow conrm that our mo del leads to this test statistic Supp ose H is orthogonal and diagonalises cov Z Asymptotically we then have JCW RAYNER AND DJ BEST T H covZ H I O r c r c T T where means direct or Kronecker sum Dene Y H Z Now Z Z T Y Y in which Y by the multivariate Central Limit Theorem is asymptotically N I under the null hyp othesis of simple random sampling rc r c r c T T It follows that under the null hyp othesis X Z Z Y Y asymptotically has P the distribution r c Partitioning Pearsons Statistic We now show that X may be partitioned into comp onents the sth of which P detects sth moment departures from the null hyp othesis of similarly distributed rows treatments rc X There is some choice in dening Y The elements Y of Y are suchthat X i i P i the Y asH is not yet fully sp ecied In doing so our aim is to nd Y that can i i e one such partition rst supp ose be easily and usefully interpreted To achiev that fg j g is the set of p olynomials orthonormal on fn n g See the App endix s j for the denitions of the rst two p olynomials and the derivation of subsequent p olynomials This approach results when there are no ties in the rst comp onent b eing the KruskalWallis test Write g for the c byvector with elements g j s s Dene G by p G G G c c in which G is the rc by r matrix s g s g s G s c and s g s c c is also rc by r G c c q T n G Z The elements of Y may be considered in blo cks of Dene Y n r the sth blo ck corresp onding to the p olynomial of order s These blo cks are T T T V in which asymptotically mutually indep endent Write Y V c T T V Y Y V Y Y r c cr cr EXTENDED KRUSKALWALLIS AND MEDIAN TESTS and V all the V are r by so that c s n n T T T T X Z Z Y Y V V V V c P c n n T n X into comp onents V V s c The V are This partitions s s s P n asymptotically mutually indep endent and asymptotically N I so that r r T the V V are asymptotically mutually indep endent Explicitly wehave for s s r s c p p c X n n T A V G Z g j Z s s ij s n n j Because V involves through g a p olynomial of order s the elements of V are s s s p olynomials of order s in the elements of N Under the null hyp othesis E Z but when this is not true E V involves moments up to order s of Z So for s T V detects sth moment departures from the null hyp othesis of s r V s s similarly distributed rows treatments Three instructors assign Instructors Example See Conover p grades in ve categories according to the following
Recommended publications
  • Statistical Analysis in JASP
    Copyright © 2018 by Mark A Goss-Sampson. All rights reserved. This book or any portion thereof may not be reproduced or used in any manner whatsoever without the express written permission of the author except for the purposes of research, education or private study. CONTENTS PREFACE .................................................................................................................................................. 1 USING THE JASP INTERFACE .................................................................................................................... 2 DESCRIPTIVE STATISTICS ......................................................................................................................... 8 EXPLORING DATA INTEGRITY ................................................................................................................ 15 ONE SAMPLE T-TEST ............................................................................................................................. 22 BINOMIAL TEST ..................................................................................................................................... 25 MULTINOMIAL TEST .............................................................................................................................. 28 CHI-SQUARE ‘GOODNESS-OF-FIT’ TEST............................................................................................. 30 MULTINOMIAL AND Χ2 ‘GOODNESS-OF-FIT’ TEST. ..........................................................................
    [Show full text]
  • Mood Median Test Example
    Mood Median Test Example Marcello is veilless: she bracket unobtrusively and twin her demurrage. Deep-fried and shotten Isador rerunning almost peculiarly, though Grady hat his majesties misrepresents. Hiralal is sanative and westernizes prolately as unmentioned Orrin frenzy alternatively and outstand aiblins. The results are significant and conclude that the median crawling ages appear to be equal for the three age group populations. What makes an extension of hotdogs have matched pairs and population distribution is greater? The Council of State Governments. It is shown below grand median test to go into families of mood test does verizon customers used instead. The NPAR1WAY Procedure SAS Support. Bpo as the null hypothesis of proportional and quantified to use to tied observations recorded. Non-parametric tests One Sample Test Wilcoxon Signed-Rank One sample tests I. Thanks for example would prefer to compare to be little difference in terms of a systematic differences is median to a mood median test example. Call access the observations from the reference group. Nonparametric Statistics in HumanComputer Interaction. That test skirts the shape assumption by testing for children different make of centrality. Simulation studies are testing to test? 23 Mood's Median Test YouTube. There were more type A people in the initial sample, a chi square approximation to. Alternative Hypothesis: The population here of the ages of soil with on five types of educational degrees are not all is same. Time Tukey HSD Mean Difference Std. THE MEDIAN TEST A SIGN TEST FOR TWO INDEPENDENT SAMPLES FUNCTION It were give information as to eclipse it is likely no two.
    [Show full text]
  • Chapter 3 Commonly Used Statistical Terms
    CHAPTER 3 COMMONLY USED STATISTICAL TERMS There are many statistics used in social science research and evaluation. The two main areas of statistics are descriptive and inferential. The third class of statistics is design and experimental statistics. Descriptive statistics involve the tabulating, depicting, and describing of col- lections of data. These data may be either quantitative or qualitative. They provide a picture or description of the properties of data collected in order to summarize them into manageable form. Inferential statistics are a formalized body of techniques that infer the properties of a larger collection of data from the inspection of that collection. They build on these statistics as they infer the properties of samples to various populations. Design and analysis statistics were developed for the discovery and confirmation of causal relationships among vari- ables in social science experiments. They use a variety of statistical tests related to aspects such as prediction and hypothesis testing. Experimental analysis is related to As in the previous glossary chapter, if you are looking for a term and it is not here, please send it to my e-mail address, [email protected], with the subject line: “New Statistical Terms Needed.” It will then be added in the next edition. 145 146 Pocket Glossary for Commonly Used Research Terms comparisons, variance, and ultimately testing whether variables are significant between each other. The lat- ter two types of statistics are usually either parametric or nonparametric. The importance of statistics in the research process is sometimes exaggerated. Thus, a highly sophisticated statistical analysis rarely, if ever, compen- sates for a poorly conceived project, a poorly constructed research design, or an inaccurate data collection instru- ment.
    [Show full text]
  • Resistant, Robust and Non-Parametric Techniques for the Analysis of Climate Data: Theory and Examples, Including Applications to Historical Radiosonde Station Data
    INTERNATIONAL JOURNAL OF CLIMATOLOGY, VOL. 16, 1197-1226 (1996) RESISTANT, ROBUST AND NON-PARAMETRIC TECHNIQUES FOR THE ANALYSIS OF CLIMATE DATA: THEORY AND EXAMPLES, INCLUDING APPLICATIONS TO HISTORICAL RADIOSONDE STATION DATA JOHN R. LANZANTE Geophysical Fluid Dynamics LaboratorylNOAA, Princeton University, Princeton, NJ 08542, USA email: [email protected] Received 16 August 1995 Accepted 7 February I996 ABSTRACT Basic traditional parametric statistical techniques are used widely in climatic studies for characterizing the level (central tendency) and variability of variables, assessing linear relationships (including trends), detection of climate change, quality control and assessment, identification of extreme events, etc. These techniques may involve estimation of parameters such as the mean (a measure of location), variance (a measure of scale) and correlatiodregression coefficients (measures of linear association); in addition, it is often desirable to estimate the statistical significance of the difference between estimates of the mean from two different samples as well as the significance of estimated measures of association. The validity of these estimates is based on underlying assumptions that sometimes are not met by real climate data. Two of these assumptions are addressed here: normality and homogeneity (and as a special case statistical stationarity); in particular, contamination from a relatively few ‘outlying values’ may greatly distort the estimates. Sometimes these common techniques are used in order to identify outliers; ironically they may fail because of the presence of the outliers! Alternative techniques drawn from the fields of resistant, robust and non-parametric statistics are usually much less affected by the presence of ‘outliers’ and other forms of non-normality. Some of the theoretical basis for the alternative techniques is presented as motivation for their use and to provide quantitative measures for their performance as compared with the traditional techniques that they may replace.
    [Show full text]
  • PASW Exact Tests
    Exact Tests ™ Cyrus R. Mehta and Nitin R. Patel For more information about SPSS® software products, please visit our WWW site at http://www.spss.com or contact Marketing Department SPSS Inc. 233 South Wacker Drive, 11th Floor Chicago, IL 60606-6307 Tel: (312) 651-3000 Fax: (312) 651-3668 SPSS is a registered trademark. PASW is a registered trademark of SPSS Inc. The SOFTWARE and documentation are provided with RESTRICTED RIGHTS. Use, duplication, or disclosure by the Government is subject to restrictions as set forth in subdivision (c)(1)(ii) of The Rights in Technical Data and Computer Software clause at 52.227-7013. Contractor/manufacturer is SPSS Inc., 233 South Wacker Drive, 11th Floor, Chicago, IL, 60606-6307. General notice: Other product names mentioned herein are used for identification purposes only and may be trademarks of their respective companies. TableLook is a trademark of SPSS Inc. Windows is a registered trademark of Microsoft Corporation. Printed in the United States of America. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior written permission of the publisher. Preface Exact Tests is a statistical package for analyzing continuous or categorical data by ex- act methods. The goal in Exact Tests is to enable you to make reliable inferences when your data are small, sparse, heavily tied, or unbalanced and the validity of the corre- sponding large sample theory is in doubt. This is achieved by computing exact p values for a very wide class of hypothesis tests, including one-, two-, and K- sample tests, tests for unordered and ordered categorical data, and tests for measures of association.
    [Show full text]
  • A Survey of Nonparametric Tests for the Statistical Analysis of Evolutionary Computational Experiments
    International Journal “Information Theories and Applications”, Vol. 17, Number 1, 2010 49 A SURVEY OF NONPARAMETRIC TESTS FOR THE STATISTICAL ANALYSIS OF EVOLUTIONARY COMPUTATIONAL EXPERIMENTS Rafael Lahoz-Beltra, Carlos Perales-Gravan Abstract: One of the main problems in the statistical analysis of Evolutionary Computation (EC) experiments is the ‘statistical personality’ of data. A main feature of EC algorithms is the sampling of solutions from one generation to the next. Sampling is based on Holland’s schema theory, having a greater probability to be chosen those solutions with best-fitness (or evaluation) values. In consequence, simulation experiments result in biased samples with non-normal, highly skewed, and asymmetric distributions. Furthermore, the main problem arises with the noncompliance of one of the main premises of the central limit theorem, invalidating the statistical analysis based on the average fitness f of the solutions. In this paper, we address a tutorial or ‘How-to’ explaining the basics of the statistical analysis of data in EC. The use of nonparametric tests for comparing two or more medians combined with Exploratory Data Analysis is a good option, bearing in mind that we are only considering two experimental situations that are common in EC practitioners: (i) the performance evaluation of an algorithm and (ii) the multiple experiments comparison. The different approaches are illustrated with different examples (see http://bioinformatica.net/tests/survey.html) selected from Evolutionary Computation and the related field of Artificial Life. Keywords: Evolutionary Computation, Statistical Analysis and Simulation. ACM Classification Keywords: G.3 PROBABILITY AND STATISTICS Conference topic: Evolutionary Computation. Introduction Evolutionary Computation (EC) refers to a class of stochastic optimization algorithms inspired in the evolution of organisms in Nature by means of Darwinian natural selection [Lahoz-Beltra, 2004][Lahoz-Beltra, 2008].
    [Show full text]
  • B.Sc III Statistics 2020-21.Pdf
    SHIVAJI UNIVERSITY, KOLHAPUR. Accredited By NAAC With ‘A’ Grade CHOICE BASED CREDIT SYSTEM Syllabus For Bachelor of Science (Part III) Statistics Syllabus to be implemented from June 2020 onwards 1 Shivaji University, Kolhapur B. Sc. III Statistics Structure of the course To be implemented from June 2020 Theory Semester V Paper. No. Title of the paper Total Marks IX Probability Distributions 50 X Statistical Inference - I 50 XI Design of Experiments 50 XII R-Programming and Quality Management 50 Semester VI Paper. No. Title of the paper Total Marks XIII Probability Theory and Applications 50 XIV Statistical Inference - II 50 XV Sampling Theory 50 XVI Operations Research 50 Practical Marks Paper Total Title of the Practical for Journal Oral No. Marks Practical IV Probability Distributions 32 4 4 40 V Statistical Inference 32 4 4 40 Designs of Experiments and Sampling VI 32 4 4 40 Methods R-Programming, Quality Managementand VII 32 4 4 40 Operations Research A Project Report & Viva -voce 40 - - 40 2 1. Nature of Project (i) Identification of problem where statistical techniques can be used. (ii) Planning and execution of data collection. (iii) The Marking system for the project work is as follows: Data Collection : 8 Marks Use of Statistical Tools : 8 Marks Analysis of Data : 8 Marks Conclusion : 8 Marks Viva on Project : 8 marks ------------------------------------------ Total Marks of Project : 40 marks (iv) Project will be conducted in a group of 5 to 6 students. 2. Nature of Question papers (Theory) COMMON NATURE OF THEORY QUESTION PAPER WILL BE MENTIONED SPERATELY: 3. Nature of practical papers: (i) Each practical question paper must contain Four questions.
    [Show full text]
  • Research Methodology and Statistical Methods.Pmd
    Research Methodology and Statistical Methods Research Methodology and Statistical Methods Morgan Shields www.edtechpress.co.uk Published by ED-Tech Press, 54 Sun Street, Waltham Abbey Essex, United Kingdom, EN9 1EJ © 2019 by ED-Tech Press Reprinted2020 Research Methodology and Statistical Methods Morgan Shields Includes bibliographical references and index. ISBN 978-1-78882-100-1 All rights reserved. No part of this publication may be reproduced, stored in retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, without either the prior written permission of the publisher or a license permitting restricted copying in the United Kingdom issued by the Copyright Licensing Agency Ltd., Saffron House, 6-10 Kirby Street, London EC1N 8TS. Trademark Notice: All trademarks used herein are the property of their respective owners. The use of any trademark in this text does not vest in the author or publisher any trademark ownership rights in such trademarks, nor does the use of such trademarks imply any affiliation with or endorsement of this book by such owners. Unless otherwise indicated herein, any third-party trademarks that may appear in this work are the property of their respective owners and any references to third-party trademarks, logos or other trade dress are for demonstrative or descriptive purposes only. Such references are not intended to imply any sponsorship, endorsement, authorization, or promotion of ED-Tech products by the owners of such marks, or any relationship between the owner and ED-Tech Press or its affiliates, authors, licensees or distributors. British Library Cataloguing in Publication Data.
    [Show full text]
  • Ranksum — Equality Tests on Unmatched Data
    Title stata.com ranksum — Equality tests on unmatched data Syntax Menu Description Options for ranksum Options for median Remarks and examples Stored results Methods and formulas References Also see Syntax Wilcoxon rank-sum test ranksum varname if in , by(groupvar) porder Nonparametric equality-of-medians test median varname if in weight , by(groupvar) median options ranksum options Description Main ∗ by(groupvar) grouping variable porder probability that variable for first group is larger than variable for second group median options Description Main ∗ by(groupvar) grouping variable exact perform Fisher’s exact test medianties(below) assign values equal to the median to below group medianties(above) assign values equal to the median to above group medianties(drop) drop values equal to the median from the analysis medianties(split) split values equal to the median equally between the two groups ∗by(groupvar) is required. by is allowed with ranksum and median; see [D] by. fweights are allowed with median; see [U] 11.1.6 weight. Menu ranksum Statistics > Nonparametric analysis > Tests of hypotheses > Wilcoxon rank-sum test median Statistics > Nonparametric analysis > Tests of hypotheses > K-sample equality-of-medians test 1 2 ranksum — Equality tests on unmatched data Description ranksum tests the hypothesis that two independent samples (that is, unmatched data) are from populations with the same distribution by using the Wilcoxon rank-sum test, which is also known as the Mann–Whitney two-sample statistic (Wilcoxon 1945; Mann and Whitney 1947). median performs a nonparametric k-sample test on the equality of medians. It tests the null hypothesis that the k samples were drawn from populations with the same median.
    [Show full text]
  • The P-Value for the Sign Test
    The p-value for the sign test Ana Marta Lisboa Vila Fernandes [email protected] Instituto Superior T´ecnico,Lisboa, Portugal December 2015 Abstract The sign test is a non-parametrical statistical procedure used to evaluate hypothesis about the q-th continuous population quantile. Denoted χq, with 0 < q < 1; the q-th continuous population quantile, on the two-sided sign test the null hypotesis states H0 : χq = χ0 while in the alternative H1 : χq 6= χ0. The test procedure is based on the sign differences between the sample observations and χ0. Considering the statistic Sn that accounts the frequency of the sign (+) on the sample of n observations, it is clear that Sn ∼ Bin(n; p) with p = 1 − q: When the observed value of the statistic sn is not close to [np], the sample favours to the alternative hypothesis. A decision rule can be conducted based on the p- value. Nevertheless, the commonly used p-value formula was originally applied on two-sided tests for statistics with continuous distributions. In the case of discrete or continuous and asymmetric statistics, the usual p-value formula can lead to incorrect values. For the two-sided sign test incoherent p-values can be obtained under particular conditions of the binomial statistic. The main goal of this thesis is to address in which situations the two-sided sign test p-values meaning is not clear. In order to solve this problem, there are some alternatives proposed in the literature and their advantages and disadvantages were analyzed. Also, a new p-value formula was introduced and its efficiency was compared with the alternative methods for the two-sided sign test.
    [Show full text]
  • Some Analytical Assays for the Determination of Bioactivity of Exotic
    Research Article Received: 9 November 2009; Revised: 1 January 2010; Accepted: 5 January 2010 Published online in Wiley Interscience: 22 February 2010 (www.interscience.wiley.com) DOI 10.1002/pca.1207 Some Analytical Assays for the Determination of Bioactivity of Exotic Fruits Shela Gorinstein,a* Ratiporn Haruenkit,b Sumitra Poovarodom,c Suchada Vearasilp,d Pramoj Ruamsuke,e Jacek Namiesnik,f Maria Leontowicz,g Hanna Leontowicz,g Milan Suhajh and Guo Ping Shengi ABSTRACT: Introduction – The consumption of new exotic fruits, with their high nutritional and sensory value, has signifi cantly increased in the past few years. Among the tropical fruits durian (Durio zibethinus Murr.) is less known than mango (Mangifera indica L.) and avocado (Persea americana). It has been shown that durian, mango and avocado possessed high nutritional and bioactive properties, but these data were determined using diff erent methods. In order to obtain reliable results we investigated samples of durian, mango and avocado of the same stage of ripeness and unifi ed methods were used for determination of the antioxidant potential. As far as we know, no results of such comparative investigation of three tropical fruits (durian, mango and avocado) and the use of such tests for phytochemical control have been published. Objective – Lyophilised durian, mango and avocado samples harvested in 2008 in Thailand and Israel were investigated. Methodology – The contents of crude protein, fat, carbohydrate, dietary fi bre, total polyphenols, fl avonoids, tannins and fl avanols were determined by elemental analysis and UV spectroscopy. The presence of polyphenols (fl avonoids and phenolic acids) in the investigated samples was studied by Fourier transform infrared (FT-IR) spectroscopy and three-dimensional fl uo- rometry.
    [Show full text]
  • Frequency Distribution, Cross-Tabulation and Hypothesis Testing Are the Fundamental Building Blocks of Quantitative Data Analysis
    CHAPTER 18 Stage 1 Frequency distribution, Problem definition cross-tabulation and hypothesis testing Stage 2 Research approach developed Objectives After reading this chapter, you should be able to: Stage 3 Research design 1 describe the significance of preliminary data analysis and the developed insights that can be obtained from such an analysis; 2 discuss data analysis associated with frequencies, including Stage 4 measures of location, measures of variability and measures of Fieldwork or data shape; collection 3 explain data analysis associated with cross-tabulations and the associated statistics: chi-square, phi coefficient, contingency Stage 5 coefficient, Cramer’s V and lambda coefficient; Data preparation 4 describe data analysis associated with parametric hypothesis and analysis testing for one sample, two independent samples and paired samples; Stage 6 5 understand data analysis associated with non-parametric Report preparation hypothesis testing for one sample, two independent samples and and presentation paired samples. Frequency distribution, cross-tabulation and hypothesis testing are the fundamental building blocks of quantitative data analysis. They provide insights into the data, guide subsequent analyses and aid the interpretation of results. Chapter 18 • Frequency distribution, cross-tabulation and hypothesis testing Overview Once the data have been prepared for analysis (Chapter 17), the researcher should con- duct basic analyses. This chapter describes basic data analyses, including frequency distribution, cross-tabulation and hypothesis testing. First, we describe the frequency distribution and explain how it provides both an indication of the number of out-of- range, missing or extreme values as well as insights into the central tendency, variability and shape of the underlying distribution.
    [Show full text]