1Cademy - The File Drawer Problem in Null Hypothesis Testing

Learn Before

Arbitrary Nature of the 0.05 Significance Threshold

Concept

The File Drawer Problem in Null Hypothesis Testing

The rigid convention of statistical significance contributes to the 'file drawer problem,' where high-quality research that does not reach the $p < 0.05$ threshold remains unpublished. Because journals prioritize significant results, many studies with similar effect sizes are excluded from the scientific literature simply because they fell on the wrong side of an arbitrary threshold.

Updated 2026-05-16

Contributors are:

Who are from:

References

KPU Research Methods in Psychology - 4th American Edition

Learn After

In the context of null hypothesis testing, which of the following best describes the 'file drawer problem'?
According to the 'file drawer problem,' if two high-quality studies demonstrate similar effect sizes, the study that fails to reach the p < 0.05 significance threshold is less likely to be published.
A research department is evaluating the outcome of several studies. Match each project's statistical result to its most likely fate or consequence according to the file drawer problem.
Analyze the progression of a research topic in the scientific community. Arrange the following events in the sequence that demonstrates how the 'file drawer problem' creates a biased representation of a psychological effect in published literature.
The 'file drawer problem' in psychological research is a consequence of how journals select studies for publication. Arrange the following steps to describe the process that leads to this bias, starting with the initial research results.
In the context of null hypothesis testing, which of the following best describes the 'file drawer problem'?
A psychology researcher is investigating the impact of the 'file drawer problem' on the perceived effectiveness of a new behavioral therapy. Match each specific research scenario to the role it plays in this phenomenon.
In a research field heavily influenced by the file drawer problem, the cumulative evidence available in published journals will likely suggest that a psychological effect is more consistent and powerful than it actually is across all conducted studies.
A researcher evaluating the reliability of a psychological phenomenon discovers that journals in the field only publish results meeting the $p < 0.05$ threshold. To provide a rigorous critique, the researcher must recognize that the 'file drawer problem' likely leads to a(n) _____ of the phenomenon's true effect size in the published literature.
When evaluating the integrity of the scientific literature, a researcher must recognize that the 'file drawer problem' creates a/an _____ record because high-quality research that does not reach the $p < 0.05$ threshold is systematically excluded from publication.

Learn Before

Related

Learn After