Essay

Explain the consequences of using test set evaluation scores to make algorithmic rollback decisions.

Question: A machine learning team evaluates their system on the test set once per week to track progress. Explain why using the weekly test set evaluation to decide whether to roll back to a previous version of the algorithm is problematic, and describe the resulting consequences on performance estimation.

Sample answer: Using the test set to make decisions regarding the algorithm, such as whether to roll back to a previous system, causes the algorithm to overfit to the test set. Because of this overfitting, the test set can no longer be counted on to provide a completely unbiased estimate of the system's performance.

Key points:

  • Making rollback decisions is an algorithmic decision guided by the test set.
  • Using the test set for algorithm decisions leads to overfitting to the test set.
  • Overfitting to the test set destroys its ability to provide an unbiased estimate of system performance.

Rubric: The response should clearly identify that making rollback decisions based on test set performance leads to overfitting to the test set. It must also explain that this overfitting prevents the test set from giving a completely unbiased estimate of the system's performance.

0

1

Updated 2026-05-26

Contributors are:

Who are from:

Tags

Machine Learning

Deep Learning

Machine Learning Strategy

Supervised Learning

Dive into Deep Learning @ D2L

Data Science

Related