Appendix D: The Replication Crisis Timeline

Key events in the discovery, investigation, and reform of psychology's methodological problems.

Year	Event	Significance
1979	Rosenthal coins "file drawer problem"	First identification of publication bias
1998	Baumeister publishes ego depletion study	Launches a 200+ study program that later fails to replicate
2005	Ioannidis: "Why Most Published Research Findings Are False"	Meta-science paper (not psychology-specific) raising alarm about false positive rates
2008	Pashler et al. review learning styles	Concludes no adequate evidence for the meshing hypothesis
2010	Bem submits precognition paper to JPSP	Uses standard methods to find "evidence" for psychic powers
2011	Bem publishes "Feeling the Future"	The paper that catalyzed the crisis — standard methods producing impossible results
2011	Simmons, Nelson, & Simonsohn: "False-positive psychology"	Demonstrates how flexible analysis produces evidence for anything (Beatles song making you younger)
2011	Diederik Stapel fraud revealed	Prominent social psychologist fabricated data across dozens of papers
2012	Doyen et al. fail to replicate elderly priming	First high-profile replication failure of a landmark social psychology finding
2012	Nosek founds Center for Open Science	Infrastructure for pre-registration and open science practices
2013	Open Science Framework (OSF) launches	Platform for pre-registration and open data
2013	Shanks et al.: 9 experiments failing to replicate professor priming	Systematic dismantling of social priming
2014	Carter & McCullough: ego depletion publication bias	Meta-analysis showing the ego depletion literature was inflated by bias
2014	Ranehill et al. fail to replicate power posing	200-participant study finds no hormonal effects
2015	Open Science Collaboration: "Estimating the reproducibility"	Only 36% of 100 tested findings replicate — the landmark paper
2015	Many Labs 1 published	Large-scale multi-lab replication of 13 classic findings; some replicate, some don't
2016	Hagger et al.: ego depletion RRR	23 labs, 2,141 participants, d = 0.04 — the flagship finding collapses
2016	Carney publicly retracts support for power posing	First author of original paper says she no longer believes the effect is real
2016	FTC fines Lumosity $2 million	Federal action against brain training claims
2017	Many Labs 2 published	28 findings tested; roughly half replicate
2018	Watts, Duncan, & Quan: marshmallow test replication	Effect substantially reduced when controlling for SES
2018	Sisk et al.: growth mindset meta-analyses	Small effects (r = 0.10; intervention d = 0.08)
2019	Yeager et al.: national growth mindset study	d = 0.03 overall — published in Nature
2019	Many Labs 5 published	Replication rate continues to show substantial non-replication
2020s	Pre-registration becomes increasingly standard	New studies are more often pre-registered, improving reliability
2022	Moncrieff et al.: serotonin hypothesis umbrella review	No consistent evidence for the serotonin theory of depression
2023+	Registered Reports adopted by 300+ journals	The reform movement reaches critical mass

What the Timeline Shows

Phase 1 (1979–2010): The Problem Accumulates. Publication bias, small samples, and flexible analysis produce a literature that overrepresents positive findings.

Phase 2 (2011–2015): The Crisis Erupts. Bem's paper, the Stapel fraud, and the OSC replication project reveal the scale of the problem.

Phase 3 (2016–present): Reform and Recovery. Pre-registration, Registered Reports, open data, and large-scale replications become standard. The field is producing more reliable new knowledge while reckoning with the unreliability of the legacy literature.

The crisis is not over. The legacy literature remains biased. But the direction is clearly toward better methods and more reliable science.