Case Study

Evaluating Non-Exclusive Column Sums in a Cat Detector Spreadsheet

Case context: During a team review of a cat classifier's dev-set error analysis spreadsheet, an engineer notices that the percentages at the bottom of the columns (representing categories like 'Great Cat', 'Blurry', and others) sum to 118%. The engineer argues that there must be a formula error in the spreadsheet because the total exceeds 100%.

Question: How should you respond to the engineer? Explain the concept of multi-category association in error analysis, reference the specific example of Image #3 from the course material, and explain why the sum of 118% is mathematically valid.

Sample answer: You should explain to the engineer that the spreadsheet formula is correct and the sum of 118% is expected. In an error-analysis spreadsheet, categories are not mutually exclusive; a single misclassified image can belong to multiple error categories. For example, Image #3 has both the 'Great Cat' and 'Blurry' columns checked. Because this single image contributes to the count of multiple categories, the column percentages at the bottom represent the frequency of each error type independently, not parts of a single 100% total. Therefore, the sum of these non-exclusive percentages will naturally exceed 100% when there is overlap.

Key points:

  • Explain that error categories in the spreadsheet are not mutually exclusive.
  • State that a single example can belong to multiple categories simultaneously.
  • Cite Image #3 from the text, which is associated with both 'Great Cat' and 'Blurry'.
  • Clarify that percentages represent the incidence of each error category independently, justifying a sum greater than 100%.

Rubric: The response must explain that error categories are not mutually exclusive, mention that one image can belong to multiple categories simultaneously, reference the Image #3 example (belonging to both 'Great Cat' and 'Blurry'), and conclude that the sum exceeding 100% is correct and expected.

0

1

Updated 2026-05-27

Contributors are:

Who are from:

Tags

Machine Learning

Deep Learning

Supervised Learning

Dive into Deep Learning @ D2L

Data Science

Machine Learning Strategy

Machine Learning Yearning @ DeepLearning.AI