1Cademy - When a model performs well on training data but poorly on the dev set due to distribution differences, what is one recommended strategy?

Learn Before

Finding Training Data That Better Matches Difficult Dev Examples

Multiple Choice

When a model performs well on training data but poorly on the dev set due to distribution differences, what is one recommended strategy?

Updated 2026-07-04

Contributors are:

Who are from:

References

Tags

Machine Learning

Deep Learning

Supervised Learning

Dive into Deep Learning @ D2L

Data Science

Machine Learning Strategy

Machine Learning Yearning @ DeepLearning.AI

No Guaranteed Path for Addressing Data Mismatch
Artificial Data Synthesis for Dev-Set Matching
Representative Synthesized Training Examples
When a model performs well on training data but poorly on the dev set due to distribution differences, what is one recommended strategy?
A data mismatch problem exists when a model performs well on its training set but poorly on a dev set drawn from a different distribution.
When most dev set audio clips were recorded in a _____, one solution is to acquire more training data from that same setting.
Match each concept related to data mismatch to its correct description from ML Yearning.
Order the steps for diagnosing and addressing a data mismatch problem in a speech recognition system.
In ML Yearning's speech recognition example, what is the primary cause of the model's poor performance on the dev set?
According to ML Yearning, acquiring training data that matches the dev set distribution is guaranteed to resolve a data mismatch problem.
To address a data mismatch problem, one recommended option is to find more training data that better _____ the dev-set examples the algorithm has trouble with.
Match each component of the speech recognition scenario to its role in ML Yearning's data mismatch framework.
Order the reasoning steps a practitioner should follow when deciding to seek targeted training data to address a data mismatch.
How does acquiring targeted training data address a data mismatch problem?
Diagnose and resolve the speech recognition mismatch
What training data strategy helps resolve data mismatch?

Learn Before

Related