1Cademy - Analyze the warning signs and causes of a development set incorrectly ranking classifiers

Learn Before

Changing Dev/Test Sets or the Metric When They No Longer Guide the Team

Essay

Analyze the warning signs and causes of a development set incorrectly ranking classifiers

Question: Explain how a development set and evaluation metric can incorrectly rank a worse classifier higher than a superior one. In your response, describe the main warning sign of this issue, and detail the three possible causes outlined by Andrew Ng.

Sample answer: The main warning sign that the dev set and metric are not working correctly is when the dev set plus metric ranks one classifier higher, but the team thinks another classifier is superior for the product. The three main possible causes of this incorrect ranking are: 1) the actual distribution the team needs to do well on is different from the dev/test sets; 2) the team has overfit to the dev set; or 3) the evaluation metric is measuring something other than what the project needs to optimize.

Key points:

The warning sign is when the dev set and metric rank one classifier higher than one the team knows is superior.
Cause 1: The actual distribution differs from the dev/test set distribution.
Cause 2: The model has overfit to the dev set.
Cause 3: The metric measures something other than the project's optimization needs.

Rubric: The response must identify the key warning sign (the dev set plus metric ranks a classifier higher but the team believes a different one is superior for the product) and list the three causes: distribution mismatch, overfitting to the dev set, and a metric that does not optimize what the project needs.

Updated 2026-06-12

Contributors are:

Who are from:

References

Learn Before

Related