Correlations & Confidence Intervals
Total Page:16
File Type:pdf, Size:1020Kb
M140 Introducing Statistics • This tutorial will begin at 7:30pm for about 70 minutes. Correlations & • I will be recording this presentation so please let me know if you have any questions or concerns about this. • Mics will be muted until the end of the tutorial, Confidence when I will also stop recording. • Feel free to use the Chat Box at any time! • I will email slides out after the tutorial and post Intervals them on my OU blog. • Things you might need today: Dr Jason Verrall • M140 Book 4 [email protected] • Calculator 07311 188 800 • Note-taking equipment • Drink of your choice Tutorials are enhanced by your interaction Please vote in the polls and ask questions! M140 Introducing Statistics Correlations & Confidence Intervals • Scatter Plots & Correlations • r values • Correlation & Causation • Confidence Intervals Computer Book has • Prediction Intervals detailed instructions! • Plus Minitab examples But you don’t need Minitab running today 2 Scatter Plots Scatter Plots & Correlations 1) Which is the positive relationship? A Variable 2 Variable Variable 1 B Variable 2 Variable Variable 1 3 Scatter Plots Scatter Plots & Correlations 1) Which is the positive relationship? A Variable 2 Variable Variable 1 B Variable 2 Variable Variable 1 4 Scatter Plots Scatter Plots & Correlations Scatterplots are notoriously tricky to interpret unless very clear, particularly the strength of any relationship 5 Scatter Plots Scatter Plots & Correlations Scatterplots are notoriously tricky to interpret unless very clear, particularly the strength of any relationship How can we improve this? 6 Scatter Plots Scatter Plots & Correlations We can use a measure of correlation to Scatterplots are notoriously tricky to objectively assess the direction and interpret unless very clear, particularly strength of any relationship between the strength of any relationship variables How can we improve this? 7 Scatter Plots Scatter Plots & Correlations We can use a measure of correlation to Scatterplots are notoriously tricky to objectively assess the direction and interpret unless very clear, particularly strength of any relationship between the strength of any relationship variables How can we improve this? Pearson product- moment correlation coefficient 8 Correlation Coefficient Book 4 pp.98-107 Pearson’s Correlation Coefficient, r 9 Correlation Coefficient Book 4 pp.98-107 Pearson’s Correlation Coefficient, r Covariance between X & Y variables 10 Correlation Coefficient Book 4 pp.98-107 Pearson’s Correlation Coefficient, r Covariance between X & Y variables Standard deviation of each variable 11 Correlation Coefficient Book 4 pp.98-107 Pearson’s Correlation Coefficient, r Covariance between X Difference between & Y variables each point and the variable mean Standard deviation of each variable 12 Correlation Coefficient Book 4 pp.98-107 Pearson’s Correlation Coefficient, r Covariance between X Difference between & Y variables each point and the variable mean Standard deviation of X Standard deviation of & Y (n is the same for each variable both variables and cancels out) 13 Correlation Coefficient Book 4 pp.98-107 Pearson’s Correlation Coefficient, r Covariance between X Difference between r is always between & Y variables each point and the -1 and 1 variable mean Standard deviation of X Standard deviation of & Y (n is the same for each variable both variables and cancels out) 14 Correlation Coefficient Book 4 pp.98-107 Worked Example of r pp. 104-105 15 Correlation Coefficient Book 4 pp.98-107 Worked Example of r pp. 104-105 Quantity Value S x 508 S y 355.6 S x2 37008.1 S y2 18600.62 S xy 26030.33 16 Correlation Coefficient Book 4 pp.98-107 Worked Example of r pp. 104-105 Quantity Value S x 508 S y 355.6 S x2 37008.1 S y2 18600.62 S xy 26030.33 17 Correlation Coefficient Book 4 pp.98-107 Worked Example of r pp. 104-105 Quantity Value S x 508 S y 355.6 S x2 37008.1 S y2 18600.62 S xy 26030.33 Quantity Value S (푥 − 푥̅) 141.814 S (푦 − 푦) 536.140 S(푥 − 푥̅)(푦 − 푦) 223.930 18 Correlation Coefficient Book 4 pp.98-107 Worked Example of r 2) What is r? Quantity Value S (푥 − 푥̅) 141.814 S (푦 − 푦) 536.140 S(푥 − 푥̅)(푦 − 푦) 223.930 19 Correlation Coefficient Book 4 pp.98-107 Worked Example of r 2) What is r? 223.93 223.93 푟 = = 141.814 × 536.14 275.74 = ퟎ. ퟖퟏퟐ Quantity Value S (푥 − 푥̅) 141.814 S (푦 − 푦) 536.140 S(푥 − 푥̅)(푦 − 푦) 223.930 20 Correlation Coefficient Book 4 pp.98-107 Worked Example of r 2) What is r? 223.93 223.93 푟 = = 141.814 × 536.14 275.74 = ퟎ. ퟖퟏퟐ Quantity Value S (푥 − 푥̅) 141.814 S (푦 − 푦) 536.140 S(푥 − 푥̅)(푦 − 푦) 223.930 21 Correlation Coefficient Samples of r 22 Correlation Coefficient Samples of r https://en.wikipedia.org/wiki/Pearson_correlation_coefficient#/media/File:Correlation_examples2.svg 23 Correlation Coefficient Samples of r 24 Correlation Coefficient Samples of r 25 Correlation Coefficient Samples of r 26 Correlation Coefficient Samples of r Pearson Correlation is best for linear relationships but can help with other relationships 27 Correlation Coefficient 3) For which graph will Samples of r correlation be informative about strength of relationship? A B 28 Correlation Coefficient 3) For which graph will Samples of r correlation be informative about strength of relationship? A Non-linear relationship and correlation unlikely to be helpful B Relationship is ‘more’ linear and correlation more likely to be informative (although not definitive) 29 Minitab Comp Bk pp. 81-91 Correlations in Minitab 30 Minitab Comp Bk pp. 81-91 Correlations in Minitab Stat → Basic Statistics → Correlation.. 1. Open Correlation dialogue box 31 Minitab Comp Bk pp. Correlations in Minitab Stat → Basic Statistics → Correlation.. 1. Open Correlation dialogue box 32 Minitab Comp Bk pp. 81-91 Correlations in Minitab Stat → Basic Statistics → Correlation.. 1. Open Correlation dialogue box 2. Select the variable pair to be analysed 33 Minitab Comp Bk pp. 81-91 Correlations in Minitab Stat → Basic Statistics → Correlation.. 1. Open Correlation dialogue box 2. Select the variable pair to be analysed 34 Minitab Comp Bk pp. 81-91 Correlations in Minitab Stat → Basic Statistics → Correlation.. 1. Open Correlation dialogue box 2. Select the variable pair to be analysed 3. Select Pearson in Options 35 Minitab Comp Bk pp. 81-91 Correlations in Minitab Stat → Basic Statistics → Correlation.. 1. Open Correlation dialogue box 2. Select the variable pair to be analysed 3. Select Pearson in Options 36 Minitab Comp Bk pp. 81-91 Correlations in Minitab Stat → Basic Statistics → Correlation.. 1. Open Correlation dialogue box 2. Select the variable pair to be analysed 3. Select Pearson in Options 4. Review output 37 Minitab Comp Bk pp. 81-91 Correlations in Minitab Stat → Basic Statistics → Correlation.. 1. Open Correlation dialogue box 2. Select the variable pair to be analysed 3. Select Pearson in Options 4. Review output 38 Correlation & Causation Does the Nose Cause the Tail? 39 Correlation & Causation Does the Nose Cause the Tail? 40 Correlation & Causation Does the Nose Cause the Tail? A strong relationship does not indicate that A causes B 41 Correlation & Causation Does the Nose Cause the Tail? A strong relationship does It only indicates that A and not indicate that A causes B are associated B 42 Correlation & Causation Does the Nose Cause the Tail? Further analysis is A strong relationship does necessary to confirm It only indicates that A and not indicate that A causes causation, including B are associated B considering the mechanism of action 43 Confidence Intervals Book 4 pp. 129-138 Confidence Intervals 44 Confidence Intervals Book 4 pp. 129-138 Confidence Intervals We use statistics such as the mean and median to summarise a group of data Sample mean = 푥̅ 45 Confidence Intervals Book 4 pp. 129-138 Confidence Intervals Population mean = 휇 We use statistics such as the mean and median to summarise a group of data Sample mean = 푥̅ However, we might want to calculate properties about a population of data which we don’t have 46 Confidence Intervals Book 4 pp. 129-138 Confidence Intervals Population mean = 휇 Point estimates We use statistics such as the mean and median to summarise a group of data Sample mean = 푥̅ However, we might want to calculate properties about a population of data which we don’t have 47 Confidence Intervals Book 4 pp. 129-138 Confidence Intervals Population mean = 휇 Point estimates We use statistics such as the mean and median to summarise a group of data Sample mean = 푥̅ However, we might want to calculate properties about a population of data which we don’t have We can use 푥̅ to estimate 휇 48 Confidence Intervals Book 4 pp. 129-138 Confidence Intervals Population mean = 휇 Point estimates We use statistics such as the mean and median to summarise a group of data Sample mean = 푥̅ However, we might want to calculate properties about a population of data which we don’t have How can the uncertainty about this We can use 푥̅ to estimate 휇 estimate be quantified? 49 Confidence Intervals Book 4 pp. 129-138 Confidence Intervals Population mean = 휇 Point estimates We use statistics such as the mean and median to summarise a group of data Sample mean = 푥̅ However, we might want to calculate properties about a population of data which we don’t have How can the Confidence Intervals! uncertainty about this We can use 푥̅ to estimate 휇 estimate be quantified? 50 Confidence Intervals Book 4 pp. 129-138 Confidence Intervals Population mean = 20 Sample mean = 15 51 Confidence Intervals Book 4 pp.