Case Study

Evaluating Team Strategy for Improving an Image Classifier Using Error Analysis

Case context: An engineering team is building an image classification model. The project leader suggests that the team spend the next month collecting and labeling a massive amount of new training data to improve accuracy. However, the lead researcher suggests first calculating and analyzing the bias and variance of the current model.

Question: Based on the concepts of major error sources, explain why the lead researcher's recommendation to analyze bias and variance is the correct first step before approving the project leader's plan to collect more data.

Sample answer: The lead researcher is correct because bias and variance are the two major sources of error in machine learning. Analyzing them allows the team to determine whether the model is limited by high bias or high variance. This understanding is essential because it reveals whether tactics like adding more training data will actually improve performance or if it would be a waste of team time.

Key points:

  • Identify bias and variance as the primary sources of error.
  • Explain that variance/bias analysis indicates whether adding more data will improve performance.
  • Conclude that analyzing these errors prevents wasting time on ineffective tactics like premature data collection.

Rubric: Student must explain that bias and variance are the two major error sources, and that analyzing them is necessary to decide if collecting more data is a productive use of time.

0

1

Updated 2026-05-26

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L

Machine Learning

Deep Learning

Supervised Learning

Data Science

Machine Learning Strategy

Related