1Cademy - Definition of SFT Datasets

Learn Before

Supervised Fine-Tuning (SFT) as an Example of Labeled Data Fine-Tuning

Definition

Definition of SFT Datasets

A Supervised Fine-Tuning (SFT) dataset, commonly denoted by $S$ , is a curated collection of annotated input-output pairs used to adapt pre-trained language models. In each pair, the input consists of a user instruction, and the output is the corresponding correct or 'gold-standard' response that the model is expected to generate.