1Cademy - Objective of Instruction Fine-Tuning

Learn Before

Instruction Fine-Tuning as a Standard Training Process

Formula

Objective of Instruction Fine-Tuning

The objective of instruction fine-tuning is to optimize the pre-trained model parameters, denoted as $\hat{\theta}$ , using a smaller fine-tuning dataset, $\mathcal{D}_{\mathrm{tune}}$ . The goal is to maximize the likelihood of generating the desired responses for the samples in the fine-tuning dataset. The objective function is formulated as:

$\tilde{\theta} = \arg\max_{\hat{\theta}^+} \sum_{\mathrm{sample} \in \mathcal{D}_{\mathrm{tune}}} \mathcal{L}_{\hat{\theta}^+}(\mathrm{sample})$

where $\tilde{\theta}$ represents the optimized parameters after fine-tuning, and $\hat{\theta}^+$ indicates that the optimization starts from the pre-trained parameters.

0

1

Updated 2026-04-19

Contributors are:

Who are from:

References

Learn Before

Related