Short Answer

What is the risk of using test set performance to guide algorithm rollback decisions?

Question: If a team uses regular test set evaluation to decide whether to roll back to a previous system version, what is the primary risk to the test set's utility?

Sample answer: The primary risk is that the team will overfit to the test set. When this happens, the test set can no longer provide a completely unbiased estimate of the system's performance.

Key points:

  • It leads to overfitting to the test set.
  • The test set can no longer give a completely unbiased estimate of system performance.

Rubric: The answer should state that the primary risk is overfitting to the test set, which prevents it from being a completely unbiased estimate of performance.

0

1

Updated 2026-05-26

Contributors are:

Who are from:

Tags

Machine Learning

Deep Learning

Machine Learning Strategy

Supervised Learning

Dive into Deep Learning @ D2L

Data Science

Related