Statistical Tips for Interpreting Scientific Claims
Total Page:16
File Type:pdf, Size:1020Kb
Research Skills Seminar Series 2019 CAHS Research Education Program Statistical Tips for Interpreting Scientific Claims Mark Jones Statistician, Telethon Kids Institute 18 October 2019 Research Skills Seminar Series | CAHS Research Education Program Department of Child Health Research | Child and Adolescent Health Service ResearchEducationProgram.org © CAHS Research Education Program, Department of Child Health Research, Child and Adolescent Health Service, WA 2019 Copyright to this material produced by the CAHS Research Education Program, Department of Child Health Research, Child and Adolescent Health Service, Western Australia, under the provisions of the Copyright Act 1968 (C’wth Australia). Apart from any fair dealing for personal, academic, research or non-commercial use, no part may be reproduced without written permission. The Department of Child Health Research is under no obligation to grant this permission. Please acknowledge the CAHS Research Education Program, Department of Child Health Research, Child and Adolescent Health Service when reproducing or quoting material from this source. Statistical Tips for Interpreting Scientific Claims CONTENTS: 1 PRESENTATION ............................................................................................................................... 1 2 ARTICLE: TWENTY TIPS FOR INTERPRETING SCIENTIFIC CLAIMS, SUTHERLAND, SPIEGELHALTER & BURGMAN, 2013 .................................................................................................................................. 15 3 ADDITIONAL RESOURCES – STATISTICAL TIPS FOR INTERPRETING SCIENTIFIC CLAIMS ..................... 18 3.1 STATISTICAL BIAS ............................................................................................................................. 18 3.2 CORRELATIONS ................................................................................................................................ 18 3.3 COGNITIVE BIAS ............................................................................................................................... 19 3.4 REFERENCES ..................................................................................................................................... 19 RESEARCH SKILLS SEMINAR SERIES 2019 CAHS Research Education Program Statistical Tips for Interpreting Scientific Claims Statistical Tips for Interpreting Scientific Claims Mark Jones|Biostatistician, Telethon Kids Institute [email protected] On behalf of Dr Julie Marsh Snr Research Fellow, Telethon Kids Institute Research Skills Seminar Series | CAHS Research Education Program Department of Child Health Research | Child and Adolescent Health Service Sutherland, Spiegelhalter and Burgman (2013). Twenty tips for interpreting scientific claims. Nature 503 (335) ResearchEducationProgram.org 2 The 20 Tips The 20 Tips 1. Differences and Chance Cause Variation 13. Significance is Significant 1. Differences and Chance Cause Variation 13. Significance is Significant 2. No Measurement is Exact 14. Separate No Effect from Non‐Significance 2. No Measurement is Exact 14. Separate No Effect from Non‐Significance 3. Bias is Rife 15. Effect Size Matters 3. Bias is Rife 15. Effect Size Matters 4. Bigger is usually Better for Sample Size 16. Study Relevance Limits Generalisation 4. Bigger is usually Better for Sample Size 16. Study Relevance Limits Generalisation 5. Correlation Does Not Imply Causation 17. Feelings Influence Risk Perception 5. Correlation Does Not Imply Causation 17. Feelings Influence Risk Perception 6. Regression to the Mean Can Mislead 18. Dependencies change the risks 6. Regression to the Mean Can Mislead 18. Dependencies change the risks 7. Extrapolating Beyond the Data is Risky 19. Data can be dredged, or cherry picked. 7. Extrapolating Beyond the Data is Risky 19. Data can be dredged, or cherry picked. 8. Beware the Base‐Rate Fallacy 20. Extreme measurements may mislead. 8. Beware the Base‐Rate Fallacy 20. Extreme measurements may mislead. 9. Controls are Important 9. Controls are Important 10. Randomisation minimises bias 10. Randomisation minimises bias Study Design 11. Seek Replication 11. Seek Replication Statistical Insights 12. Scientists are Human 12. Scientists are Human Human Factors 3 4 1 Randomisation (minimises bias) Randomization avoids bias EXPERIMENTAL UNIT ‐ Population smallest ‘thing’ to which Part 1: Study Design Treatment Group we allocate treatments. Treatment Randomised Allocation Placebo Simple Random Control Group Sample Source: Final Fantasy VII 5 Replication Controls are Important Seek Replication, not pseudoreplication “Results consistent across many studies, replicated on independent populations, are more likely to be solid.” “The results of several such experiments may be combined in a meta‐analysis to provide an overarching view of the topic with potentially much greater statistical power than any of the individual studies.” 7 8 2 Generalisability (aka external validity) Generalisability (aka external validity) Study relevance limits generalizations Study relevance limits generalizations “At its best a trial shows what can be accomplished with a medicine under careful observation and certain restricted conditions. The same results will not invariably or necessarily be observed when the medicine passes into general use.” Austin Bradford Hill, 1984 9 10 Sample Size (adopt the goldilocks principle) Bigger is usually better for sample size Part 2: Statistical Insights Average efficacy can be more reliably and accurately estimated from a study with hundreds of participants than from a study with only a few participants. Basic analyses often assume that participants are homogenous. 12 11 3 Sample Size (adopt the goldilocks principle) Differences and chance cause variation Bigger is usually better for sample size “The main challenge of research is teasing apart the importance of the process of interest from the innumerable other sources of variation.” too small too big just right 13 14 Differences and chance cause variation Differences and chance cause variation “The main challenge of research is teasing apart the importance of the process of “The main challenge of research is teasing apart the importance of the process of interest from the innumerable other sources of variation.” interest from the innumerable other sources of variation.” 15 16 4 Hypotheses, Effects and Significance Hypotheses, Effects and Significance Significance is significant, Separate no effect from non‐significance, Effect size matters Significance is significant, Separate no effect from non‐significance, Effect size matters Hypothesis Test is a statistical technique to assist with answering questions or Hypothesis Test is a statistical technique to assist with answering questions or claims about population parameters by sampling from the population of interest. claims about population parameters by sampling from the population of interest. P‐values give the probability that an observed association could have arisen solely P‐values give the probability that an observed association could have arisen solely as a result of chance bias in sampling, given the null hypothesis –it is dependent as a result of chance bias in sampling, given the null hypothesis –it is dependent on sample size and variability. on sample size and variability. p‐value=0.01 means there is a 1‐in‐100 probability that what looks like an effect of p‐value=0.01 means there is a 1‐in‐100 probability that what looks like an effect of the treatment could have occurred randomly, and in truth there was no effect at the treatment could have occurred randomly, and in truth there was no effect at all. all. 17 18 Hypotheses, Effects and Significance Hypotheses, Effects and Significance Significance is significant, Separate no effect from non‐significance, Effect size matters Significance is significant, Separate no effect from non‐significance, Effect size matters Hypothesis Test is a statistical technique to assist with answering questions or Hypothesis Test is a statistical technique to assist with answering questions or claims about population parameters by sampling from the population of interest. claims about population parameters by sampling from the population of interest. P‐values give the probability that an observed association could have arisen solely P‐values give the probability that an observed association could have arisen solely as a result of chance bias in sampling, given the null hypothesis –it is dependent as a result of chance bias in sampling, given the null hypothesis –it is dependent on sample size and variability. on sample size and variability. p‐value=0.01 means there is a 1‐in‐100 probability that what looks like an effect of p‐value=0.01 means there is a 1‐in‐100 probability that what looks like an effect of the treatment could have occurred randomly, and in truth there was no effect at the treatment could have occurred randomly, and in truth there was no effect at all. all. 19 20 5 Hypotheses, Effects and Significance Hypotheses, Effects and Significance Significance is significant, Separate no effect from non‐significance, Effect size matters Significance is significant, Separate no effect from non‐significance, Effect size matters 1. P‐values can indicate how incompatible the data are with a specified statistical model. 2. P‐values do not measure the probability that the studied hypothesis is true, or the probability that the data were produced by random chance alone. 3. Scientific conclusions and business