1Cademy - Insufficiency of Data Fitting for Value Alignment

Learn Before

Challenges in LLM Alignment

Concept

Insufficiency of Data Fitting for Value Alignment

The alignment of LLMs cannot be achieved simply by fitting the model to a limited set of human-annotated data. Such samples are often insufficient to describe the full spectrum of desired behaviors related to complex human values. The goal is therefore not just data fitting, but teaching the model to determine which outputs are more consistent with human preferences.

Updated 2025-10-07

Contributors are: