Relation

How to decide the quantity of the data to be gathered

  • It is helpful to plot curves showing the relationship between training set size and generalization error. By extrapolating such curves, one can predict how much additional training data would be needed to achieve a certain level of performance.
  • It is recommended to experiment with training set sizes on a logarithmic scale, for example, doubling the number of examples between consecutive experiments.

0

1

Updated 2021-07-12

Tags

Data Science