Dev Set Overfitting from Repeated Evaluation
Repeatedly evaluating ideas on the dev set can cause an algorithm to gradually overfit to the dev set. If dev set performance is much better than test set performance when development is finished, that is a sign of dev-set overfitting. In that case, get a fresh dev set.
0
1
Tags
Machine Learning
Deep Learning
Machine Learning Strategy
Supervised Learning
Dive into Deep Learning @ D2L
Data Science
Related
Dev/Test Set Distribution Not Representative of Actual Distribution
Dev Set Overfitting from Repeated Evaluation
Metric Optimizes the Wrong Project Objective
Which scenario is a warning sign that your dev/test set or evaluation metric needs to change?
True or False: Discovering that your initial dev/test set or metric missed the mark is a serious setback that cannot be easily corrected.
If your _____ is no longer measuring what is most important to you, Ng recommends changing it rather than continuing to optimize for it.
Which of the following is the key warning sign that your dev/test set or evaluation metric may need to be changed?
Ng considers discovering that a dev/test set or metric missed the mark to be a serious setback that requires restarting the evaluation process from scratch.
If your metric is no longer measuring what is most important to your project, you should change the _____.
Match each cause of a dev set/metric incorrectly ranking classifiers to the fix Ng recommends.
Order the steps a team should take upon discovering their dev/test set or metric is no longer guiding them correctly.
Your dev set contains formal customer emails but users primarily submit short social media posts. Which cause does this best illustrate?
After changing your dev/test sets or evaluation metric, updating the project files is sufficient — there is no need to explicitly inform the team of the new direction.
If you have overfit to the dev set, Ng recommends getting more _____ data.
Match each problem scenario to the cause category it represents in Ng's framework for incorrect classifier ranking.
Order the reasoning steps for deciding whether and how to change an evaluation metric that may no longer reflect project goals.
Analyze the warning signs and causes of a development set incorrectly ranking classifiers
Diagnosing Classifier Ranking Mismatch in Spam Detection
Response to Overfitting the Development Set
Learn After
Avoiding Test Set Decisions During Regular Progress Tracking
What causes an algorithm to gradually overfit to the dev set during the development process?
If dev set performance is much better than test set performance after development, this is a sign of dev-set overfitting.
When dev set performance is much better than test set performance, Machine Learning Yearning recommends you get a _____ dev set.
Match each concept related to dev-set overfitting to its correct description.
Arrange the steps of the full dev-set overfitting lifecycle in the correct order.
After development, your dev set performance is far better than your test set performance. What does Machine Learning Yearning recommend?
According to Machine Learning Yearning, you should regularly evaluate your algorithm on the test set throughout development to track progress.
The process of repeatedly evaluating ideas on the dev set causes your algorithm to gradually _____ to the dev set.
Match each verbatim phrase from Machine Learning Yearning to what it refers to or signifies.
Arrange the reasoning steps for diagnosing and responding to a dev-test performance gap in the correct order.
Analyzing the Cause and Solution for Dev Set Overfitting
Diagnosing a Performance Gap After Development
Identifying the Indicator of Dev Set Overfitting