Concept

Biased Predictions in LLM-based Synthetic Data Generation

When using Large Language Models to synthetically generate data for specific tasks, such as text classification, a potential issue is the emergence of biased predictions. This can manifest as an imbalance in the generated samples, where the majority of instances fall into a single category, leading to a skewed dataset.

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences