Learn Before
Concept

Sizing the Dev Set to Detect Meaningful Accuracy Changes

A dev set should be large enough to detect differences between the algorithms being tried. For example, a 100-example dev set would not be able to detect a 0.1 percentage-point accuracy difference such as 90.0% versus 90.1%. Dev sets of 1,000 to 10,000 examples are common, and a dev set could be much larger than 10,000 when teams want to detect even smaller improvements.

0

1

Updated 2026-05-25

Contributors are:

Who are from:

Tags

Machine Learning

Deep Learning

Supervised Learning

Dive into Deep Learning @ D2L

Data Science

Machine Learning Strategy

Related