1Cademy - Detecting Eyeball Dev Set Overfitting by Comparing Performance Against the Blackbox Dev Set

Learn Before

Eyeball Dev Set

Concept

Detecting Eyeball Dev Set Overfitting by Comparing Performance Against the Blackbox Dev Set

Because one gains intuition about the examples in the Eyeball dev set while looking at them, one will start to overfit the Eyeball dev set faster. If performance on the Eyeball dev set improves much more rapidly than performance on the Blackbox dev set, the Eyeball dev set has been overfit. Explicitly splitting the dev set into Eyeball and Blackbox subsets allows one to tell when the manual error analysis process is causing overfitting of the Eyeball portion.

Updated 2026-06-17

Contributors are: