Concept

SFT's Reliance on Labeled Data

Supervised Fine-Tuning (SFT) is fundamentally different from pre-training due to its requirement for labeled data, whereas pre-training utilizes raw text that is readily and widely available. This dependency introduces significant challenges, as the tasks of data annotation and selection are complex, similar to other supervised machine learning domains.

Image 0

0

1

Updated 2026-05-01

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences