1Cademy - Probabilistic Objective of Supervised Fine-Tuning

Learn Before

Instruction Fine-Tuning as a Standard Training Process

Formula

Probabilistic Objective of Supervised Fine-Tuning

The objective of supervised fine-tuning is to determine the optimal model parameters, $\tilde{\theta}$ , by maximizing an objective function, $L$ , over all samples in the fine-tuning dataset, $D_{tune}$ . The optimization process begins with the parameters initialized from the pre-trained model, denoted as $\hat{\theta}^{+}$ . The formal mathematical representation of this objective is: $\tilde{\theta} = \arg \max_{\hat{\theta}^{+}} \sum_{\text{sample} \in D_{tune}} L_{\hat{\theta}^{+}}(\text{sample})$ This equation frames fine-tuning as a maximization problem, which typically corresponds to maximizing the likelihood of the training data.

0

1

Updated 2026-06-25

Contributors are:

Who are from:

References

Learn Before

Related

Learn After