Concept

Insufficiency of Data Fitting for Complex Value Alignment

Aligning LLMs with complex human values is not merely a data-fitting task. Limited, human-annotated samples are often insufficient to describe the full range of desired behaviors. The core objective is to teach the model a general capability to determine which outputs are more aligned with human preferences, rather than just having it replicate a fixed set of examples.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences