Learn Before
Adding Training Data Does Not Help Much When Training Error Is High
When high avoidable bias is the problem, adding more training data is not a helpful technique because it helps variance problems but usually has no significant effect on bias. One should first improve training-set performance before expecting dev/test performance to improve.
0
1
References
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Tags
Machine Learning
Deep Learning
Supervised Learning
Dive into Deep Learning @ D2L
Data Science
Machine Learning Strategy
Machine Learning Yearning @ DeepLearning.AI
Related
Adding Training Data Does Not Help Much When Training Error Is High
Classifier with Low Bias and Low Variance Is Doing Well
Negative Avoidable Bias Indicates Training Set Overfitting
Informal Bias and Variance Definitions Differ from Statisticians' Definitions
Training Set Performance Comes Before Dev/Test Performance
What does an algorithm's bias informally measure according to Machine Learning Yearning?
True or False: In Machine Learning Yearning, an algorithm's informal bias is defined as its error rate on the dev/test set.
Informally, an algorithm's _____ is its error rate on the training set when the training set is very large.
What does 'bias' informally refer to according to Machine Learning Yearning?
Bias is informally defined as the algorithm's error rate on the training set.
Informally, an algorithm's _____ is its error rate on the training set.
Match each term to its description in Machine Learning Yearning's bias/variance framework.
Order the steps to correctly estimate an algorithm's bias according to Machine Learning Yearning.
Why does Machine Learning Yearning qualify bias as the training error rate on a 'very large' training set?
In Machine Learning Yearning, bias is defined as the algorithm's error rate on the dev or test set.
Roughly, the bias is the error rate of your algorithm on your _____ set when you have a very large training set.
Match each aspect of the bias definition to what it represents in Machine Learning Yearning.
Order the reasoning steps to determine whether bias is the primary error source in an underperforming ML algorithm.
Explain the Relationship Between Informal Bias and Training Set Size
Evaluating Training Error as a Measure of Bias
Distinguishing the Informal Definition of Bias
Learn After
Your cat recognizer has 15% training error and 16% dev error, but your target is 5% error. What should you do first?
True or False: Adding more training data is an effective technique for reducing high training error caused by high avoidable bias.
Adding more training data is a technique that helps with _____ problems, but it usually has no significant effect on bias.
Your training error is 15% and your target is 5%. What should you focus on first?
True or False: Adding more training data is an effective technique for reducing high avoidable bias.
Adding more training data helps with _____ problems but usually has no significant effect on bias.
Match each scenario or technique to its correct description regarding bias and variance.
Order the steps for correctly diagnosing and responding to a model with high training error.
Training error is 15%, dev error is 16%, and target is 5%. What does this pattern primarily indicate?
True or False: You should expect significant dev/test improvement even if training error remains high after adding more data.
When training error is high, first improve performance on the _____ set before expecting dev/test performance to improve.
Match each error pattern to the correct diagnosis and recommended action.
Order the decision-making steps a practitioner should follow when evaluating whether to add more training data.
Analyze why expanding the training dataset fails to resolve high training-set error.
Diagnosing a cat recognizer with high training and dev error rates.
Identify the primary system metric to improve before expecting dev/test set gains.