1Cademy - State the primary risk of fixing only misclassified dev set labels.

Learn Before

Bias from Fixing Labels of Only Misclassified Dev Examples

Short Answer

State the primary risk of fixing only misclassified dev set labels.

Question: What is the primary risk to your evaluation metrics if you only correct the labels of dev set examples that your system misclassified?

Sample answer: The primary risk is introducing bias into the dev set evaluation. Specifically, it will artificially inflate the measured performance of your system because you are only correcting errors in one direction (changing misclassified to correct) while ignoring errors that favor the model.

Key points:

It introduces bias into the dev set evaluation.
It artificially inflates the measured performance metrics of the system.

Rubric: The answer should state that it introduces evaluation bias, specifically an optimistic bias that artificially inflates the system's measured performance.

0

1

Updated 2026-07-02

Contributors are:

Who are from:

References

Machine Learning Yearning (Deeplearning.ai)

Learn Before

Related