When dev and test sets share the same distribution and test performance is worse than dev performance, what does this clearly indicate?
0
1
References
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Tags
Machine Learning
Deep Learning
Supervised Learning
Dive into Deep Learning @ D2L
Data Science
Machine Learning Strategy
Related
Dev Set Should Reflect the Task to Improve Most
Same-Distribution Dev/Test Failure Indicates Dev Set Overfitting
Different Dev/Test Distributions Make Failure Diagnosis Ambiguous
Third-Party Benchmark Distribution Mismatch Increases Luck
When dev and test sets share the same distribution and test performance is worse than dev performance, what does this clearly indicate?
True or False: When dev and test sets come from different distributions, a performance gap between them has a single, unambiguous diagnosis.
When a system has overfit the dev set and both sets share the same distribution, the obvious cure is to get more _____ data.
Why should the dev set reflect the task a team wants to improve on the most?
If both sets share the same distribution and a model performs well on dev but poorly on test, the clear diagnosis is dev set overfitting.
When a model overfits the dev set and both sets share the same distribution, the obvious cure is to get more _____ data.
Match each dev/test set scenario to its consequence for model diagnosis.
Order the diagnostic steps when a model works well on the dev set but fails on the test set.
Which is a possible explanation for poor test performance when dev and test sets come from different distributions?
When dev and test sets come from different distributions, a system's failure on the test set provides an unambiguous diagnosis.
Once the dev and test sets are defined, a team will be focused on improving _____ set performance.
Match each concept related to dev/test distribution to its correct description.
Order the steps for selecting dev and test sets that support clear model evaluation.
Compare the diagnostics of poor test performance under same vs. different dev/test distributions.
Diagnosing a drop in test set performance with mismatched distributions.
Identify the diagnosis and cure for poor test performance when distributions match.