The Sampling Distribution of the Mean
Total Page:16
File Type:pdf, Size:1020Kb
THE DISTRIBUTION OF SAMPLE MEANS
Inferential statistics:
Generalize from a sample to a population
Statistics vs. Parameters
Why?
Population not often possible
Limitation:
Sample won’t precisely reflect population
Samples from same population vary
“sampling variability”
Sampling error = discrepancy between sample statistic and population parameter
1 Extend z-scores and normal curve to SAMPLE MEANS rather than individual scores
How well will a sample describe a population?
What is probability of selecting a sample that has a certain mean?
Sample size will be critical
Larger samples are more representative
Larger samples = smaller error
2 THE DISTRIBUTION OF SAMPLE MEANS
Population of 4 scores: 2 4 6 8 = 5
4 random samples (n = 2):
X 1= 4 X 3 = 5
X 2 = 6 X 4 = 3
X is rarely exactly
Most X a little bigger or smaller than
Most X will cluster around
Extreme low or high values of X are relatively rare
With larger n, X s will cluster closer to µ (the DSM will have smaller error, smaller variance)
3 A Distribution of Sample Means
X = 4 X = 5 X = 6
Figure 7-3 (p. 205) The distribution of sample means for n = 2. This distribution shows the 16 sample means obtained by taking all possible random samples of size n=2 that can be drawn from the population of 4 scores (see Table 7.1 in text). The known population mean from which these samples were drawn is µ = 5.
4 THE DISTRIBUTION OF SAMPLE MEANS
A distribution of sample means (X ) All possible random samples of size n A distribution of a statistic (not raw scores)
“Sampling Distribution” of X
Probability of getting an X , given known and Important properties
(1) Mean (2) Standard Deviation (3) Shape
5 PROPERTIES OF THE DSM
Mean?
X =
Called expected value of X
X is an unbiased estimate of Standard Deviation?
Any X can be viewed as a deviation from
X = Standard Error of the Mean
X = n
Variability of X around
Special type of standard deviation, type of “error”
Average amount by which X deviates from
6 Less error = better, more reliable, estimate of population parameter
X influenced by two things:
(1) Sample size (n)
Larger n = smaller standard errors
Note: when n = 1 X =
as “starting point” for X,
X gets smaller as n increases
(2) Variability in population ()
Larger = larger standard errors
Note: X = M
7 Figure 7-7 (p.215) The distribution of sample means for random samples of size (a) n = 1, (b) n = 4, and (c) n = 100 obtained from a normal population with µ = 80 and σ = 20. Notice that the size of the standard error decreases as the sample size increases.
8 Shape of the DSM?
Central Limit = DSM will approach a normal dist’n Theorem as n approaches infinity
Very important!
True even when raw scores NOT normal!
True regardless of or
What about sample size?
(1) If raw scores ARE normal, any n will do
(2) If raw scores NOT normal, n must be “sufficiently large”
For most distributions n 30
9 Why are Sampling Distributions important?
Tells us probability of getting X , given &
Distribution of a STATISTIC rather than raw scores
Theoretical probability distribution
Critical for inferential statistics!
Allows us to estimate likelihood of making an error when generalizing from sample to popl’n
Standard error = variability due to chance
Allows us to estimate population parameters
Allows us to compare differences between sample means – due to chance or to experimental treatment?
Sampling distribution is the most fundamental concept underlying all statistical tests
10 WORKING WITH THE
DISTRIBUTION OF SAMPLE MEANS
. If we assume DSM is normal . If we know & . We can use Normal Curve & Unit Normal Table!
z = X x
Example #1: = 80 = 12
What is probability of getting X 86 if n = 9?
11 Example #1b: = 80 = 12
What if we change n =36
What is probability of getting X 86
12 Example #2:
= 80 = 12
What X marks the point beyond which sample means are likely to occur only 5% of the time? (n = 9)
13 Homework problems: Chapter 7: 3, 10, 11, 17
14