Essay

Analyzing Computational and Representational Costs of Unhelpful Training Data

Question: Explain how incorporating training data that has no benefit impacts a machine learning system's training process, specifically focusing on computational resources and neural-network representation capacity. Ground your answer in the concept of excluding data that deviates entirely from the dev/test distribution.

Sample answer: According to the source, adding training data that has no benefit should be left out for computational reasons. Including irrelevant data wastes valuable computational resources during training. Furthermore, including such data wastes the neural-network's representation capacity, as the network dedicates its parameters to learning and representing features that are irrelevant to the dev/test distribution.

Key points:

  • Data with no benefit should be left out of training for computational reasons.
  • Including irrelevant data wastes neural-network representation capacity.
  • Irrelevant data forces the network to learn features that do not align with the dev/test distribution.

Rubric: A satisfactory response must mention that unhelpful training data consumes/wastes computational resources during training and also wastes the neural-network's representation capacity by forcing it to represent patterns that do not benefit the dev/test distribution.

0

1

Updated 2026-05-26

Contributors are:

Who are from:

Tags

Machine Learning

Deep Learning

Supervised Learning

Dive into Deep Learning @ D2L

Data Science

Machine Learning Strategy

Related