Fixed Vs. Random Effects

Total Page:16

File Type:pdf, Size:1020Kb

Fixed Vs. Random Effects Statistics 203: Introduction to Regression and Analysis of Variance Fixed vs. Random Effects Jonathan Taylor - p. 1/19 Today’s class ● Today’s class ■ Random effects. ● Two-way ANOVA ● Random vs. fixed effects ■ ● When to use random effects? One-way random effects ANOVA. ● Example: sodium content in ■ beer Two-way mixed & random effects ANOVA. ● One-way random effects model ■ ● Implications for model Sattherwaite’s procedure. ● One-way random ANOVA table ● Inference for µ· ● 2 Estimating σµ ● Example: productivity study ● Two-way random effects model ● ANOVA tables: Two-way (random) ● Mixed effects model ● Two-way mixed effects model ● ANOVA tables: Two-way (mixed) ● Confidence intervals for variances ● Sattherwaite’s procedure - p. 2/19 Two-way ANOVA ● Today’s class ■ Second generalization: more than one grouping variable. ● Two-way ANOVA ● Random vs. fixed effects ■ ● When to use random effects? Two-way ANOVA model: observations: ● Example: sodium content in ≤ ≤ ≤ ≤ ≤ ≤ beer (Yijk); 1 i r; 1 j m; 1 k nij : r groups in first ● One-way random effects model grouping variable, m groups ins second and nij samples in ● Implications for model ● One-way random ANOVA (i; j)-“cell”: table ● Inference for µ· 2 ● 2 Estimating σµ Yijk = µ + αi + βj + (αβ)ij + "ijk; "ijk ∼ N(0; σ ): ● Example: productivity study ● Two-way random effects ■ model Constraints: ● ANOVA tables: Two-way ◆ r (random) αi = 0 ● i=1 Mixed effects model ◆ Pm ● Two-way mixed effects model j=1 βj = 0 ● ANOVA tables: Two-way Pm (mixed) ◆ (αβ) = 0; 1 ≤ i ≤ r ● Confidence intervals for j=1 ij variances ◆ Pr ● Sattherwaite’s procedure (αβ)ij = 0; 1 ≤ j ≤ m: Pi=1 - p. 3/19 Random vs. fixed effects ● Today’s class ■ In ANOVA examples we have seen so far, the categorical ● Two-way ANOVA ● Random vs. fixed effects variables are well-defined categories: below average fitness, ● When to use random effects? ● Example: sodium content in long duration, etc. beer ● One-way random effects ■ model In some designs, the categorical variable is “subject”. ● Implications for model ● One-way random ANOVA ■ Simplest example: repeated measures, where more than table ● Inference for µ· one (identical) measurement is taken on the same individual. ● Estimating σ2 µ ■ ● Example: productivity study In this case, the “group” effect αi is best thought of as ● Two-way random effects model random because we only sample a subset of the entire ● ANOVA tables: Two-way (random) population of subjects. ● Mixed effects model ● Two-way mixed effects model ● ANOVA tables: Two-way (mixed) ● Confidence intervals for variances ● Sattherwaite’s procedure - p. 4/19 When to use random effects? ● Today’s class ■ A “group” effect is random if we can think of the levels we ● Two-way ANOVA ● Random vs. fixed effects observe in that group to be samples from a larger population. ● When to use random effects? ● Example: sodium content in ■ beer Example: if collecting data from different medical centers, ● One-way random effects model “center” might be thought of as random. ● Implications for model ● One-way random ANOVA ■ Example: if surveying students on different campuses, table ● Inference for µ· “campus” may be a random effect. ● 2 Estimating σµ ● Example: productivity study ● Two-way random effects model ● ANOVA tables: Two-way (random) ● Mixed effects model ● Two-way mixed effects model ● ANOVA tables: Two-way (mixed) ● Confidence intervals for variances ● Sattherwaite’s procedure - p. 5/19 Example: sodium content in beer ● Today’s class ■ How much sodium is there in North American beer? How ● Two-way ANOVA ● Random vs. fixed effects much does this vary by brand? ● When to use random effects? ● Example: sodium content in ■ beer Observations: for 6 brands of beer, researchers recorded the ● One-way random effects model sodium content of 8 12 ounce bottles. ● Implications for model ● One-way random ANOVA ■ Questions of interest: what is the “grand mean” sodium table ● Inference for µ· content? How much variability is there from brand to brand? ● Estimating σ2 µ ■ ● Example: productivity study “Individuals” in this case are brands, repeated measures are ● Two-way random effects model the 8 bottles. ● ANOVA tables: Two-way (random) ● Mixed effects model ● Two-way mixed effects model ● ANOVA tables: Two-way (mixed) ● Confidence intervals for variances ● Sattherwaite’s procedure - p. 6/19 One-way random effects model ● Today’s class ■ Suppose we take n identical measurements from r subjects. ● Two-way ANOVA ● Random vs. fixed effects ■ ● When to use random effects? Yij ∼ µ· + αi + "ij ; 1 ≤ i ≤ r; 1 ≤ j ≤ n ● Example: sodium content in ■ 2 beer " ∼ N(0; σ ); 1 ≤ i ≤ r; 1 ≤ j ≤ n ● One-way random effects ij model ■ 2 ● Implications for model αi ∼ N(0; σµ); 1 ≤ i ≤ r: ● One-way random ANOVA table ■ ● Inference for µ· We might be interested in the population mean, µ·: CIs, is it ● 2 Estimating σµ zero? etc. ● Example: productivity study ● Two-way random effects ■ Alternatively, we might be interested in the variability across model ● ANOVA tables: Two-way subjects, 2 : CIs, is it zero? (random) σµ ● Mixed effects model ● Two-way mixed effects model ● ANOVA tables: Two-way (mixed) ● Confidence intervals for variances ● Sattherwaite’s procedure - p. 7/19 Implications for model ● Today’s class ■ In random effects model, the observations are no longer ● Two-way ANOVA ● Random vs. fixed effects independent (even if "’s are independent). In fact ● When to use random effects? ● Example: sodium content in beer 0 0 2 0 2 0 ● One-way random effects Cov(Yij; Yi j ) = σµδi;i + σ δj;j : model ● Implications for model ■ ● One-way random ANOVA In more complicated mixed effects models, this makes MLE table ● Inference for µ· more complicated: not only are there parameters in the ● 2 Estimating σµ ● Example: productivity study mean, but in the covariance as well. ● Two-way random effects ■ model In ordinary least squares regression, the only parameter to ● ANOVA tables: Two-way 2 2 (random) estimate is σ because the covariance matrix is σ I. ● Mixed effects model ● Two-way mixed effects model ● ANOVA tables: Two-way (mixed) ● Confidence intervals for variances ● Sattherwaite’s procedure - p. 8/19 One-way random ANOVA table ● Today’s class Source SS df E(MS) ● Two-way ANOVA r 2 Treatments SST R = n Y · − Y ·· r − 1 σ2 + nσ2 ● Random vs. fixed effects Pi=1 “ i ” µ ● When to use random effects? r n − 2 − 2 Error SSE = i=1 j=1(Yij Y i·) (n 1)r σ ● P P Example: sodium content in ■ beer Only change here is the expectation of SST R which reflects ● One-way random effects model randomness of α ’s. ● Implications for model i ● One-way random ANOVA ■ table ANOVA table is still useful to setup tests: the same F ● Inference for µ· ● 2 statistics for fixed or random will work here. Estimating σµ ● Example: productivity study ■ 2 ● Two-way random effects Under H0 : σµ = 0, it is easy to see that model ● ANOVA tables: Two-way (random) MST R ● Mixed effects model ∼ Fr−1;(n−1)r: ● Two-way mixed effects model MSE ● ANOVA tables: Two-way (mixed) ● Confidence intervals for variances ● Sattherwaite’s procedure - p. 9/19 Inference for µ· ● Today’s class ■ ● Two-way ANOVA We know that E(Y ··) = µ·, and can show that ● Random vs. fixed effects ● When to use random effects? 2 2 ● Example: sodium content in nσµ + σ beer Var(Y ··) = : ● One-way random effects rn model ● Implications for model ■ ● One-way random ANOVA Therefore, table ● Inference for µ· Y ·· − µ· ● 2 Estimating σµ ∼ tr−1 ● Example: productivity study SST R ● Two-way random effects q (r−1)rn model ● ANOVA tables: Two-way ■ (random) Why r − 1 degrees of freedom? Imagine we could record an ● Mixed effects model ● Two-way mixed effects model infinite number of observations for each individual, so that ● ANOVA tables: Two-way (mixed) Y i· ! µi. ● Confidence intervals for variances ■ · ● Sattherwaite’s procedure To learn anything about µ we still only have r observations (µ1; : : : ; µr). ■ Sampling more within an individual cannot narrow the CI for µ·. - p. 10/19 2 Estimating σµ ● Today’s class ■ From the ANOVA table ● Two-way ANOVA ● Random vs. fixed effects ● When to use random effects? 2 E(SST R=(r − 1)) − E(SSE=((n − 1)r)) ● Example: sodium content in σ = : beer µ ● One-way random effects n model ● Implications for model ■ Natural estimate: ● One-way random ANOVA table ● Inference for µ· 2 SST R=(r − 1) − SSE=((n − 1)r) ● 2 Estimating σµ Sµ = ● Example: productivity study n ● Two-way random effects model ■ ● ANOVA tables: Two-way Problem: this estimate can be negative! One of the (random) ● Mixed effects model difficulties in random effects model. ● Two-way mixed effects model ● ANOVA tables: Two-way (mixed) ● Confidence intervals for variances ● Sattherwaite’s procedure - p. 11/19 Example: productivity study ● Today’s class ■ Imagine a study on the productivity of employees in a large ● Two-way ANOVA ● Random vs. fixed effects manufacturing company. ● When to use random effects? ● Example: sodium content in ■ beer Company wants to get an idea of daily productivity, and how ● One-way random effects model it depends on which machine an employee uses. ● Implications for model ● ■ One-way random ANOVA Study: take m employees and r machines, having each table ● Inference for µ· employee work on each machine for a total of n days. ● Estimating σ2 µ ■ ● Example: productivity study As these employees are not all employees, and these ● Two-way random effects model machines are not all machines it makes sense to think of ● ANOVA tables: Two-way (random) both the effects of machine and employees (and ● Mixed effects model ● Two-way mixed effects model interactions) as random. ● ANOVA tables: Two-way (mixed) ● Confidence intervals for variances ● Sattherwaite’s procedure - p.
Recommended publications
  • Regression Summary Project Analysis for Today Review Questions: Categorical Variables
    Statistics 102 Regression Summary Spring, 2000 - 1 - Regression Summary Project Analysis for Today First multiple regression • Interpreting the location and wiring coefficient estimates • Interpreting interaction terms • Measuring significance Second multiple regression • Deciding how to extend a model • Diagnostics (leverage plots, residual plots) Review Questions: Categorical Variables Where do those terms in a categorical regression come from? • You cannot use a categorical term directly in regression (e.g. 2(“Yes”)=?). • JMP converts each categorical variable into a collection of numerical variables that represent the information in the categorical variable, but are numerical and so can be used in regression. • These special variables (a.k.a., dummy variables) use only the numbers +1, 0, and –1. • A categorical variable with k categories requires (k-1) of these special numerical variables. Thus, adding a categorical variable with, for example, 5 categories adds 4 of these numerical variables to the model. How do I use the various tests in regression? What question are you trying to answer… • Does this predictor add significantly to my model, improving the fit beyond that obtained with the other predictors? (t-ratio, CI, p-value) • Does this collection of predictors add significantly to my model? Partial-F (Note: the only time you need partial F is when working with categorical variables that define 3 or more categories. In these cases, JMP shows you the partial-F as an “Effect Test”.) • Does my full model explain “more than random variation”? Use the F-ratio from the Anova summary table. Statistics 102 Regression Summary Spring, 2000 - 2 - How do I interpret JMP output with categorical variables? Term Estimate Std Error t Ratio Prob>|t| Intercept 179.59 5.62 32.0 0.00 Run Size 0.23 0.02 9.5 0.00 Manager[a-c] 22.94 7.76 3.0 0.00 Manager[b-c] 6.90 8.73 0.8 0.43 Manager[a-c]*Run Size 0.07 0.04 2.1 0.04 Manager[b-c]*Run Size -0.10 0.04 -2.6 0.01 • Brackets denote the JMP’s version of dummy variables.
    [Show full text]
  • Linear Mixed-Effects Modeling in SPSS: an Introduction to the MIXED Procedure
    Technical report Linear Mixed-Effects Modeling in SPSS: An Introduction to the MIXED Procedure Table of contents Introduction. 1 Data preparation for MIXED . 1 Fitting fixed-effects models . 4 Fitting simple mixed-effects models . 7 Fitting mixed-effects models . 13 Multilevel analysis . 16 Custom hypothesis tests . 18 Covariance structure selection. 19 Random coefficient models . 20 Estimated marginal means. 25 References . 28 About SPSS Inc. 28 SPSS is a registered trademark and the other SPSS products named are trademarks of SPSS Inc. All other names are trademarks of their respective owners. © 2005 SPSS Inc. All rights reserved. LMEMWP-0305 Introduction The linear mixed-effects models (MIXED) procedure in SPSS enables you to fit linear mixed-effects models to data sampled from normal distributions. Recent texts, such as those by McCulloch and Searle (2000) and Verbeke and Molenberghs (2000), comprehensively review mixed-effects models. The MIXED procedure fits models more general than those of the general linear model (GLM) procedure and it encompasses all models in the variance components (VARCOMP) procedure. This report illustrates the types of models that MIXED handles. We begin with an explanation of simple models that can be fitted using GLM and VARCOMP, to show how they are translated into MIXED. We then proceed to fit models that are unique to MIXED. The major capabilities that differentiate MIXED from GLM are that MIXED handles correlated data and unequal variances. Correlated data are very common in such situations as repeated measurements of survey respondents or experimental subjects. MIXED extends repeated measures models in GLM to allow an unequal number of repetitions.
    [Show full text]
  • Application of Random-Effects Probit Regression Models
    Journal of Consulting and Clinical Psychology Copyright 1994 by the American Psychological Association, Inc. 1994, Vol. 62, No. 2, 285-296 0022-006X/94/S3.00 Application of Random-Effects Probit Regression Models Robert D. Gibbons and Donald Hedeker A random-effects probit model is developed for the case in which the outcome of interest is a series of correlated binary responses. These responses can be obtained as the product of a longitudinal response process where an individual is repeatedly classified on a binary outcome variable (e.g., sick or well on occasion t), or in "multilevel" or "clustered" problems in which individuals within groups (e.g., firms, classes, families, or clinics) are considered to share characteristics that produce similar responses. Both examples produce potentially correlated binary responses and modeling these per- son- or cluster-specific effects is required. The general model permits analysis at both the level of the individual and cluster and at the level at which experimental manipulations are applied (e.g., treat- ment group). The model provides maximum likelihood estimates for time-varying and time-invari- ant covariates in the longitudinal case and covariates which vary at the level of the individual and at the cluster level for multilevel problems. A similar number of individuals within clusters or number of measurement occasions within individuals is not required. Empirical Bayesian estimates of per- son-specific trends or cluster-specific effects are provided. Models are illustrated with data from
    [Show full text]
  • 10.1 Topic 10. ANOVA Models for Random and Mixed Effects References
    10.1 Topic 10. ANOVA models for random and mixed effects References: ST&DT: Topic 7.5 p.152-153, Topic 9.9 p. 225-227, Topic 15.5 379-384. There is a good discussion in SAS System for Linear Models, 3rd ed. pages 191-198. Note: do not use the rules for expected MS on ST&D page 381. We provide in the notes below updated rules from Chapter 8 from Montgomery D.C. (1991) “Design and analysis of experiments”. 10. 1. Introduction The experiments discussed in previous chapters have dealt primarily with situations in which the experimenter is concerned with making comparisons among specific factor levels. There are other types of experiments, however, in which one wish to determine the sources of variation in a system rather than make particular comparisons. One may also wish to extend the conclusions beyond the set of specific factor levels included in the experiment. The purpose of this chapter is to introduce analysis of variance models that are appropriate to these different kinds of experimental objectives. 10. 2. Fixed and Random Models in One-way Classification Experiments 10. 2. 1. Fixed-effects model A typical fixed-effect analysis of an experiment involves comparing treatment means to one another in an attempt to detect differences. For example, four different forms of nitrogen fertilizer are compared in a CRD with five replications. The analysis of variance takes the following familiar form: Source df Total 19 Treatment (i.e. among groups) 3 Error (i.e. within groups) 16 And the linear model for this experiment is: Yij = µ + i + ij.
    [Show full text]
  • Using Survey Data Author: Jen Buckley and Sarah King-Hele Updated: August 2015 Version: 1
    ukdataservice.ac.uk Using survey data Author: Jen Buckley and Sarah King-Hele Updated: August 2015 Version: 1 Acknowledgement/Citation These pages are based on the following workbook, funded by the Economic and Social Research Council (ESRC). Williamson, Lee, Mark Brown, Jo Wathan, Vanessa Higgins (2013) Secondary Analysis for Social Scientists; Analysing the fear of crime using the British Crime Survey. Updated version by Sarah King-Hele. Centre for Census and Survey Research We are happy for our materials to be used and copied but request that users should: • link to our original materials instead of re-mounting our materials on your website • cite this as an original source as follows: Buckley, Jen and Sarah King-Hele (2015). Using survey data. UK Data Service, University of Essex and University of Manchester. UK Data Service – Using survey data Contents 1. Introduction 3 2. Before you start 4 2.1. Research topic and questions 4 2.2. Survey data and secondary analysis 5 2.3. Concepts and measurement 6 2.4. Change over time 8 2.5. Worksheets 9 3. Find data 10 3.1. Survey microdata 10 3.2. UK Data Service 12 3.3. Other ways to find data 14 3.4. Evaluating data 15 3.5. Tables and reports 17 3.6. Worksheets 18 4. Get started with survey data 19 4.1. Registration and access conditions 19 4.2. Download 20 4.3. Statistics packages 21 4.4. Survey weights 22 4.5. Worksheets 24 5. Data analysis 25 5.1. Types of variables 25 5.2. Variable distributions 27 5.3.
    [Show full text]
  • Analysis of Variance with Categorical and Continuous Factors: Beware the Landmines R. C. Gardner Department of Psychology Someti
    Analysis of Variance with Categorical and Continuous Factors: Beware the Landmines R. C. Gardner Department of Psychology Sometimes researchers want to perform an analysis of variance where one or more of the factors is a continuous variable and the others are categorical, and they are advised to use multiple regression to perform the task. The intent of this article is to outline the various ways in which this is normally done, to highlight the decisions with which the researcher is faced, and to warn that the various decisions have distinct implications when it comes to interpretation. This first point to emphasize is that when performing this type of analysis, there are no means to be discussed. Instead, the statistics of interest are intercepts and slopes. More on this later. Models. To begin, there are a number of approaches that one can follow, and each of them refers to a different model. Of primary importance, each model tests a somewhat different hypothesis, and the researcher should be aware of precisely which hypothesis is being tested. The three most common models are: Model I. This is the unique sums of squares approach where the effects of each predictor is assessed in terms of what it adds to the other predictors. It is sometimes referred to as the regression approach, and is identified as SSTYPE3 in GLM. Where a categorical factor consists of more than two levels (i.e., more than one coded vector to define the factor), it would be assessed in terms of the F-ratio for change when those levels are added to all other effects in the model.
    [Show full text]
  • Chapter 17 Random Effects and Mixed Effects Models
    Chapter 17 Random Effects and Mixed Effects Models Example. Solutions of alcohol are used for calibrating Breathalyzers. The following data show the alcohol concentrations of samples of alcohol solutions taken from six bottles of alcohol solution randomly selected from a large batch. The objective is to determine if all bottles in the batch have the same alcohol concentrations. Bottle Concentration 1 1.4357 1.4348 1.4336 1.4309 2 1.4244 1.4232 1.4213 1.4256 3 1.4153 1.4137 1.4176 1.4164 4 1.4331 1.4325 1.4312 1.4297 5 1.4252 1.4261 1.4293 1.4272 6 1.4179 1.4217 1.4191 1.4204 We are not interested in any difference between the six bottles used in the experiment. We therefore treat the six bottles as a random sample from the population and use a random effects model. If we were interested in the six bottles, we would use a fixed effects model. 17.3. One Random Factor 17.3.1. The Random‐Effects One‐way Model For a completely randomized design, with v treatments of a factor T, the one‐way random effects model is , :0, , :0, , all random variables are independent, i=1, ..., v, t=1, ..., . Note now , and are called variance components. Our interest is to investigate the variation of treatment effects. In a fixed effect model, we can test whether treatment effects are equal. In a random effects model, instead of testing : ..., against : treatment effects not all equal, we use : 0,: 0. If =0, then all random effects are 0.
    [Show full text]
  • A Computationally Efficient Method for Selecting a Split Questionnaire Design
    Iowa State University Capstones, Theses and Creative Components Dissertations Spring 2019 A Computationally Efficient Method for Selecting a Split Questionnaire Design Matthew Stuart Follow this and additional works at: https://lib.dr.iastate.edu/creativecomponents Part of the Social Statistics Commons Recommended Citation Stuart, Matthew, "A Computationally Efficient Method for Selecting a Split Questionnaire Design" (2019). Creative Components. 252. https://lib.dr.iastate.edu/creativecomponents/252 This Creative Component is brought to you for free and open access by the Iowa State University Capstones, Theses and Dissertations at Iowa State University Digital Repository. It has been accepted for inclusion in Creative Components by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. A Computationally Efficient Method for Selecting a Split Questionnaire Design Matthew Stuart1, Cindy Yu1,∗ Department of Statistics Iowa State University Ames, IA 50011 Abstract Split questionnaire design (SQD) is a relatively new survey tool to reduce response burden and increase the quality of responses. Among a set of possible SQD choices, a design is considered as the best if it leads to the least amount of information loss quantified by the Kullback-Leibler divergence (KLD) distance. However, the calculation of the KLD distance requires computation of the distribution function for the observed data after integrating out all the missing variables in a particular SQD. For a typical survey questionnaire with a large number of categorical variables, this computation can become practically infeasible. Motivated by the Horvitz-Thompson estima- tor, we propose an approach to approximate the distribution function of the observed in much reduced computation time and lose little valuable information when comparing different choices of SQDs.
    [Show full text]
  • Describing Data: the Big Picture Descriptive Statistics Community
    The Big Picture Describing Data: Categorical and Quantitative Variables Population Sampling Sample Statistical Inference Exploratory Data Analysis Descriptive Statistics Community Coalitions (n = 175) In order to make sense of data, we need ways to summarize and visualize it. Summarizing and visualizing variables and relationships between two variables is often known as exploratory data analysis (also known as descriptive statistics). The type of summary statistics and visualization methods to use depends on the type of variables being analyzed (i.e., categorical or quantitative). One Categorical Variable Frequency Table “What is your race/ethnicity?” A frequency table shows the number of cases that fall into each category: White Black “What is your race/ethnicity?” Hispanic Asian Other White Black Hispanic Asian Other Total 111 29 29 2 4 175 Display the number or proportion of cases that fall into each category. 1 Proportion Proportion The sample proportion (̂) of directors in each category is White Black Hispanic Asian Other Total 111 29 29 2 4 175 number of cases in category pˆ The sample proportion of directors who are white is: total number of cases 111 ̂ .63 63% 175 Proportion and percent can be used interchangeably. Relative Frequency Table Bar Chart A relative frequency table shows the proportion of cases that In a bar chart, the height of the bar corresponds to the fall in each category. number of cases that fall into each category. 120 111 White Black Hispanic Asian Other 100 .63 .17 .17 .01 .02 80 60 40 All the numbers in a relative frequency table sum to 1.
    [Show full text]
  • Collecting Data
    Statistics deal with the collection, presentation, analysis and interpretation of data.Insurance (of people and property), which now dominates many aspects of our lives, utilises statistical methodology. Social scientists, psychologists, pollsters, medical researchers, governments and many others use statistical methodology to study behaviours of populations. You need to know the following statistical terms. A variable is a characteristic of interest in each element of the sample or population. For example, we may be interested in the age of each of the seven dwarfs. An observation is the value of a variable for one particular element of the sample or population, for example, the age of the dwarf called Bashful (= 619 years). A data set is all the observations of a particular variable for the elements of the sample, for example, a complete list of the ages of the seven dwarfs {685, 702,498,539,402,685, 619}. Collecting data Census The population is the complete set of data under consideration. For example, a population may be all the females in Ireland between the ages of 12 and 18, all the sixth year students in your school or the number of red cars in Ireland. A census is a collection of data relating to a population. A list of every item in a population is called a sampling frame. Sample A sample is a small part of the population selected. A random sample is a sample in which every member of the population has an equal chance of being selected. Data gathered from a sample are called statistics. Conclusions drawn from a sample can then be applied to the whole population (this is called statistical inference).
    [Show full text]
  • Questionnaire Analysis Using SPSS
    Questionnaire design and analysing the data using SPSS page 1 Questionnaire design. For each decision you make when designing a questionnaire there is likely to be a list of points for and against just as there is for deciding on a questionnaire as the data gathering vehicle in the first place. Before designing the questionnaire the initial driver for its design has to be the research question, what are you trying to find out. After that is established you can address the issues of how best to do it. An early decision will be to choose the method that your survey will be administered by, i.e. how it will you inflict it on your subjects. There are typically two underlying methods for conducting your survey; self-administered and interviewer administered. A self-administered survey is more adaptable in some respects, it can be written e.g. a paper questionnaire or sent by mail, email, or conducted electronically on the internet. Surveys administered by an interviewer can be done in person or over the phone, with the interviewer recording results on paper or directly onto a PC. Deciding on which is the best for you will depend upon your question and the target population. For example, if questions are personal then self-administered surveys can be a good choice. Self-administered surveys reduce the chance of bias sneaking in via the interviewer but at the expense of having the interviewer available to explain the questions. The hints and tips below about questionnaire design draw heavily on two excellent resources. SPSS Survey Tips, SPSS Inc (2008) and Guide to the Design of Questionnaires, The University of Leeds (1996).
    [Show full text]
  • Which Statistical Test
    Which Statistical Test Using this aid Understand the definition of terms used in the aid. Start from Data in Figure 1 and go towards the right to select the test you want depending on the data you have. Then go to the relevant instructions (S1, S2, S3, S4, S5, S6, S7, S8, S9 or S10) to perform the test in SPSS. The tests are mentioned below. S1: Normality Test S2: Parametric One-Way ANOVA Test S3: Nonparametric Tests for Several Independent Variables S4: General Linear Model Univariate Analysis S5: Parametric Paired Samples T Test S6: Nonparametric Two Related Samples Test S7: Parametric Independent Samples T Tests S8: Nonparametric Two Independent Samples T Tests S9: One-Sample T Test S10: Crosstabulation (Chi-Square) Test Definition of Terms Data type is simply the various ways that you use numbers to collect data for analysis. For example in nominal data you assign numbers to certain words e.g. 1=male and 2=female. Or rock can be classified as 1=sedimentary, 2=metamorphic or 3=igneous. The numbers are just labels and have no real meaning. The order as well does not matter. For ordinal data the order matters as they describe order e.g. 1st, 2nd 3rd. They can also be words such as ‘bad’, ‘medium’, and ‘good’. Both nominal and ordinal data are also refer to as categorical data. Continuous data refers to quantitative measurement such as age, salary, temperature, weight, height. It is good to understand type of data as they influence the type of analysis that you can do.
    [Show full text]