1Cademy - Adding Training Data Does Not Help Much When Training Error Is High

Learn Before

Bias (Informal Definition)

Concept

Adding Training Data Does Not Help Much When Training Error Is High

When high avoidable bias is the problem, adding more training data is not a helpful technique because it helps variance problems but usually has no significant effect on bias. One should first improve training-set performance before expecting dev/test performance to improve.

Updated 2026-05-26

Contributors are:

Who are from:

References

Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)

Learn After

Your cat recognizer has 15% training error and 16% dev error, but your target is 5% error. What should you do first?
True or False: Adding more training data is an effective technique for reducing high training error caused by high avoidable bias.
Adding more training data is a technique that helps with _____ problems, but it usually has no significant effect on bias.
Your training error is 15% and your target is 5%. What should you focus on first?
True or False: Adding more training data is an effective technique for reducing high avoidable bias.
Adding more training data helps with _____ problems but usually has no significant effect on bias.
Match each scenario or technique to its correct description regarding bias and variance.
Order the steps for correctly diagnosing and responding to a model with high training error.
Training error is 15%, dev error is 16%, and target is 5%. What does this pattern primarily indicate?
True or False: You should expect significant dev/test improvement even if training error remains high after adding more data.
When training error is high, first improve performance on the _____ set before expecting dev/test performance to improve.
Match each error pattern to the correct diagnosis and recommended action.
Order the decision-making steps a practitioner should follow when evaluating whether to add more training data.
Analyze why expanding the training dataset fails to resolve high training-set error.
Diagnosing a cat recognizer with high training and dev error rates.
Identify the primary system metric to improve before expecting dev/test set gains.

Learn Before

Related

Learn After