1Cademy - Dev Set Overfitting from Repeated Evaluation

Learn Before

Changing Dev/Test Sets or the Metric When They No Longer Guide the Team

Concept

Dev Set Overfitting from Repeated Evaluation

Repeatedly evaluating ideas on the dev set can cause an algorithm to gradually overfit to the dev set. If dev set performance is much better than test set performance when development is finished, that is a sign of dev-set overfitting. In that case, get a fresh dev set.

Updated 2026-06-17

Contributors are:

Who are from:

References