1Cademy - A development team has a small, high-quality dataset for training a sentiment analysis model. To improve the models performance without collecting more user data, they use a powerful, general-purpose language model to paraphrase each existing example, generating five new variations for every original sentence while preserving the sentiment label. This process of creating synthetic training examples is most directly analogous to which traditional machine learning practice, and why?

Learn Before

Analogy to NLP Data Augmentation in Synthetic Data Generation

Multiple Choice

A development team has a small, high-quality dataset for training a sentiment analysis model. To improve the model's performance without collecting more user data, they use a powerful, general-purpose language model to paraphrase each existing example, generating five new variations for every original sentence while preserving the sentiment label. This process of creating synthetic training examples is most directly analogous to which traditional machine learning practice, and why?

Updated 2025-10-05

Contributors are:

Who are from:

Learn Before

Related