Measures of Central Tendency and Measures of Dispersion
SLWK 381 Foundations of Social Work Research II Rachel C. Casey, MSW Agenda
● Review Homework #2 ● Learn about measures of central tendency ● Practice 1 ● Break ● Learn about measures of dispersion ● Practice 2 ● Instructions for Homework #3 Measures of Central Tendency
Measures of Central Tendency = Descriptive statistics that use a single number to summarize data level by describing what is typical in a distribution ● Mode ● Median ● Mean Levels of Measurement Chart
Level of measurement determines which measures of central tendency can be used
Nominal Ordinal Interval Ratio Mode X X X X Median X X X Mean X X Mode
● Appropriate for data at the nominal level and above ● Refers to the category with the largest frequency in the distribution ● Distributions can have more than one mode ○ Bimodal ○ Multimodal ● Does not provide information about variation in scores
How to find the mode: Identify the most frequently reported response Mode: Example
How to find the mode: Identify the most frequently reported response
TELEVISION SHOW ƒ The Bachelorette 4 Scandal 6 Mode = Scandal Brooklyn 99 4
Modern Family 3
Law & Order: SVU 3 Median
● Appropriate for data at the ordinal level and above ● Refers to the score that divides the distribution into two equal parts with half of the cases above it and half the cases below it ● Not affected by outlier scores
How to find the median: Arrange responses from lowest to highest If N is odd, the median is the number in the middle of the list If N is even, the median is the average of the two middle numbers Median: Example
How to find the median: Arrange responses from lowest to highest If N is odd, the median is the number in the middle of the list If N is even, the median is the average of the two middle numbers
Example 1: 1, 2, 3, 4, 5, 6, 7, 8, 9 Median = 5
Example 2: 10, 35, 27, 9, 18, 21 9, 10, 18, 21, 27, 35 (18+21)/2 = 19 Median = 19 Mean
● Appropriate for data at the interval/ratio level ● Refers to the arithmetic average obtained by adding up all the scores and dividing by the number of cases
“sigma” = summation “X bar” = sample mean
How to find the mean: X̄ = ∑X / N
“N” = number of cases Mean: Example
How to find the mean: X̄ = ∑X / N
Example 1: 1, 2, 3, 4, 5, 6, 7, 8, 9 X̄=(1+2+3+4+5+6+7+8+9)/ 9 Mean = 5 Example 2: 10, 35, 27, 9, 18, 21 X̄=(10+35+27+9+18+21)/ 6 Mean = 20 Practice 1 : Measures of Central Tendency
● Refer to the “raw data” in this Google Spreadsheet. ● For each variable in the data set, determine which measures of central tendency are appropriate to use and calculate them. ● Report your answers on this Google Form. Using SPSS to Calculate Measures of Central Tendency Mean: Influenced by Outliers
Outlier = a score that is very different from most others
Example: 10, 35, 27, 9, 18, 21, 578 X̄=(10+35+27+9+18+21+578)/ 7 Mean = 99.7
Sometimes, the researcher will make a judgment call to exclude outliers when calculating the mean 10, 35, 27, 9, 18, 21 X̄=(10+35+27+9+18+21)/ 6 Mean = 20 Measures of Dispersion
Measures of Central Tendency = Descriptive statistics that indicate how much variability exists in the distribution of a particular variable ● Range ● Interquartile Range ● Variance ● Standard Deviation Levels of Measurement Chart
Measures of dispersion can only be calculated for variables at the interval/ratio level
Nominal Ordinal Interval Ratio Range X X Variance X X Standard X X Deviation Range
● Appropriate for data at the interval/ratio level ● Refers to the total number of possible values between the minimum and maximum values in a distribution
How to find the range: Maximum Score - Minimum Score*
Example:
10, 35, 27, 9, 18, 21 35 - 9 = 26 Range = 26
*Rubin text identifies a slightly different formula for calculating the range, but this one is most common Range: Influenced by Outliers
● Like the mean, the range will be highly influenced by outliers
How to find the range: Maximum Score - Minimum Score
Example: Range = 569 10, 35, 27, 9, 18, 21, 578 578 - 9 = 569
10, 35, 27, 9, 18, 21 35 - 9 = 26 Range = 26 Interquartile Range
● Range based on intermediate scores, not extreme scores
○ Addresses issue of outliers ● Refers to the difference between the values of the upper quartile and lower quartile when the data is ordered and divided into four quartiles
For this class, you don’t need to know how to calculate the interquartile range. It is important to understand it as a concept. Variance
● Appropriate for data at the interval/ratio level ● Refers to the average variability in the distribution of a variable
○ In technical terms, refers to the average of the squared deviations from the mean
2 How to find the variance: Variance = ∑(X - X̄) / (N-1)
Don’t freak out! We’ll break this down! Calculating the Variance
2 How to find the variance: Variance = ∑(X - X̄) / (N-1)
2 X X̄ (X - X̄) (X - X̄) 9 20 -11 121 10 20 -10 100
18 20 -2 4 ∑(X - X̄)2 = 121+100+4+1+49+225 = 500 21 20 1 1 500/(N-1) = 500/(6-1) = 500/5 = 100 27 20 7 49 35 20 15 225 Variance = 100 Standard Deviation
● Appropriate for data at the interval/ratio level ● Refers to the average variability in the distribution of a variable measured in the same units of measurement as the original data
○ In technical terms, refers to square root of the variance
How to find the standard deviation: 2 Standard Deviation = [∑(X - X̄) / (N-1)] = Variance Calculating the Standard Deviation
2 How to find the standard deviation: [∑(X - X̄) / (N-1)]
X X̄ (X - X̄) (X - X̄)2 ∑(X - X̄)2 = 121+100+4+1+49+225 = 500 9 20 -11 121 10 20 -10 100 500/(N-1) = 500/(6-1) = 500/5 = 100
18 20 -2 4 100 = 10 21 20 1 1
27 20 7 49 Standard Deviation = 10
35 20 15 225 Why do we care about Standard Deviation?
● Standard deviation shows us how “spread out” the data is
○ Do the study participants have similar or varying experiences? ● The larger the standard deviation, the more variability in the data
Shafer, K., Fielding, B., & Wendt, D. (2017). Similarities and differences in the influence of paternal and maternal depression on adolescent well-being. Social Work Research, 41(2). Practice 2 : Measures of Dispersion
● Refer to the “raw data” in this Google Spreadsheet. ● Calculate the range, variance and standard deviation for the variables “Age” and “Weight.” ● Report your answers on this Google Form. Using SPSS to Calculate Measures of Dispersion