Concept

Generalization as an Outcome of SFT

A key result of Supervised Fine-Tuning (SFT) is that the model gains the ability to generalize its task execution capabilities. For instance, after being fine-tuned on a set of question-answer pairs, an LLM can correctly respond to new questions that were not included in the specialized SFT dataset.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related