Learn Before
Case Study

Investigating performance gaps through dataset comparison

Case context: You are developing a machine learning model where you observe poor performance on the dev set compared to the training set, indicating a potential data mismatch. You decide to initiate an error analysis.

Question: Based on the principles of error analysis for data mismatch, what specific elements must your analysis compare to successfully diagnose the root cause?

Sample answer: The analysis must compare the training set and the dev set to understand the significant differences between them, as these differences are what lead to the data mismatch.

Key points:

  • Compares the training set against the dev set.
  • Seeks to understand significant differences.
  • Links the dataset differences to the root cause of the mismatch.

Rubric: The answer must state that the error analysis should focus on identifying and understanding the differences between the training set and the dev set.

0

1

Updated 2026-05-27

Contributors are:

Who are from:

Tags

Machine Learning

Deep Learning

Supervised Learning

Dive into Deep Learning @ D2L

Data Science

Machine Learning Strategy

Machine Learning Yearning @ DeepLearning.AI