1Cademy - Objective of Supervised Fine-Tuning

Learn Before

Fine-tuning LLMs with Labeled Data

Concept

Objective of Supervised Fine-Tuning

The primary goal of Supervised Fine-Tuning (SFT) is to adapt a model that already has pre-trained parameters, denoted as $\hat{\theta}$ . The adaptation process involves adjusting these parameters to maximize the conditional probability of generating the desired output sequence, $y$ , given a specific input sequence, $x$ . This objective, expressed as maximizing $\mathrm{Pr}(y|x)$ , aligns the model's behavior with the provided instruction-response examples.

Updated 2026-06-17

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn After

Maximum Likelihood Estimation (MLE) as the Objective for Supervised Fine-Tuning
A development team is fine-tuning a pre-trained language model using a curated dataset of customer support inquiries (inputs) and their corresponding ideal, human-written responses (outputs). The aim is to create a specialized chatbot that reliably provides answers in the same helpful and accurate style as the examples. From a probabilistic perspective, which statement best describes the fundamental objective of this training process?
Correcting a Flawed Fine-Tuning Objective
Objective for a Specialized Math Tutor
Mathematical Formulation of the Supervised Fine-Tuning Objective
Conditional vs. Joint Probability Objectives in Language Modeling

Learn Before

Related

Learn After