Concept

Generalization from Supervised Fine-Tuning

A key outcome of Supervised Fine-Tuning (SFT) is the model's ability to generalize its learning. After being trained on a set of specific examples, such as question-answer pairs, the LLM can correctly respond to new questions that were not included in the SFT dataset.

Image 0

0

1

Updated 2026-04-30

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related