1Cademy - Fundamental Approaches to LLM Alignment

Dataset 1: A curated set of prompts, each paired with a single, ideal, human-written response that demonstrates how to follow the prompt&#x27;s instructions correctly.
Dataset 2: A set of prompts where, for each prompt, a human evaluator has ranked several different model-generated responses from best to worst.

Learn Before

LLM Alignment
Post-Pre-training Alignment Steps

Classification

Fundamental Approaches to LLM Alignment

Two of the most widely-used and foundational approaches for aligning Large Language Models are instruction alignment and human preference alignment. Instruction alignment typically employs supervised fine-tuning techniques to teach the model to follow user instructions, while human preference alignment often uses reinforcement learning techniques based on human feedback.

Updated 2026-05-03

Contributors are: