1Cademy - Characteristics of the Starting Model for Alignment

Learn Before

Establishing the Initial Policy in RLHF

Short Answer

Characteristics of the Starting Model for Alignment

In a process designed to align a language model with human preferences using feedback, what are the two essential training stages the model should have completed before the feedback process begins? Briefly explain the purpose of each stage in preparing the model to serve as a functional baseline.

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences