Short Answer

Name the two primary goals of artificial data synthesis when training and dev distributions differ

Question: When using artificial data synthesis to match a development set, what two main characteristics must the synthesized training dataset possess according to Machine Learning Yearning?

Sample answer: The synthesized training dataset must be a huge dataset and it must reasonably match the dev set.

Key points:

  • It must be a huge dataset
  • It must reasonably match the dev set

Rubric: The answer is correct if it states that the dataset must be huge/large and that it must reasonably match/align with the dev set.

0

1

Updated 2026-05-26

Contributors are:

Who are from:

Tags

Machine Learning

Deep Learning

Supervised Learning

Dive into Deep Learning @ D2L

Data Science

Machine Learning Strategy

Related