How many participants do we have to include in properly powered experiments? A tutorial of power analysis with reference tables

Hannah · July 22, 2019, 12:08pm

This might be useful for some of you. It’s a really good hands-on power-analysis tutorial for the most common designs in psychology and cognitive science (from simple t-tests up to more complex designs like 2x2 repeated-measures ANOVA, all also as Bayesian analyses). It provides benchmarks for how large samples need to be for typical effect sizes depending on the number of factors, the pattern of results you predict (like different interaction patterns), and strength of correlations between several within-subject measures. Definitely goes beyond the tutorials I have seen so far.

Abstract: Given that an effect size of d = .4 is a good first estimate of the smallest effect size of interest in psychological research, we already need over 50 participants for a simple comparison of two within-participants conditions if we want to run a study with 80% power. This is more than current practice. In addition, as soon as a between-groups variable or an interaction is involved, numbers of 100, 200, and even more participants are needed. As long as we do not accept these facts, we will keep on running underpowered studies with unclear results. Addressing the issue requires a change in the way research is evaluated by supervisors, examiners, reviewers, and editors. The present paper describes reference numbers needed for the designs most often used by psychologists, including single-variable between-groups and repeated-measures designs with two and three levels, two-factor designs involving two repeated-measures variables or one between-groups variable and one repeated-measures variable (split-plot design). The numbers are given for the traditional, frequentist analysis with p < .05 and Bayesian analysis with BF > 10. These numbers provide researchers with a standard to determine (and justify) the sample size of an upcoming study. The article also describes how researchers can improve the power of their study by including multiple observations per condition per participant.

Topic		Replies	Views
Preprint: Sample Size Justification Social sciences power-analysis , preprint , statistical-power , sample-size	1	434	January 7, 2021
New paper: Many Labs 5: Testing Pre-Data-Collection Peer Review as an Intervention to Increase Replicability Open and replicable science metascience , psychology , new-paper , crowdsourcing	1	371	November 16, 2020
Are interventions in reproductive medicine assessed for plausible and clinically relevant effects? A systematic review of power and precision in trials and meta-analyses Natural sciences metascience , new-paper , reproductive-medicine , statistical-power , metaanalysis	0	414	April 10, 2019
The replication crisis has led to positive structural, procedural, and community changes Open and replicable science open-science , replicability , new-paper , replicability-crisis , replicable-science	6	265	September 30, 2024
Asking for collaboration on nutrition and healthy lifestyle, biases and reproducibility issues Open and replicable science open-science , life-science , statistics	3	297	February 4, 2024

How many participants do we have to include in properly powered experiments? A tutorial of power analysis with reference tables

Related topics