Learn Before
Concept

Insufficiency of Data Fitting for Value Alignment

The alignment of LLMs cannot be achieved simply by fitting the model to a limited set of human-annotated data. Such samples are often insufficient to describe the full spectrum of desired behaviors related to complex human values. The goal is therefore not just data fitting, but teaching the model to determine which outputs are more consistent with human preferences.

0

1

Updated 2025-10-07

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences