Case Study

Diagnose generalization issues in a speech recognition system.

Case context: Suppose you have developed a speech recognition system that does very well on the training set and on the training dev set. However, it does poorly on your dev set.

Question: Based on this scenario, diagnose the specific problem your speech recognition system is experiencing and explain the underlying reason for this issue.

Sample answer: The speech recognition system is experiencing a data mismatch problem. This occurs because the training set data is a poor match for the dev set data, causing the system to perform well on the training and training dev sets (which share the same distribution) but poorly on the dev set.

Key points:

  • Diagnose the issue as a data mismatch problem.
  • Acknowledge the system performs well on the training set and training dev set but poorly on the dev set.
  • Explain that the training set data is a poor match for the dev set data.

Rubric: The student must diagnose the problem as data mismatch and explain that it is caused by the training set data being a poor match for the dev set data.

0

1

Updated 2026-05-27

Contributors are:

Who are from:

Tags

Machine Learning

Deep Learning

Supervised Learning

Dive into Deep Learning @ D2L

Data Science

Machine Learning Strategy

Machine Learning Yearning @ DeepLearning.AI