1Cademy - Optimal Parameters Formula in Fine-Tuning

Learn Before

Notation for Parameters in the Fine-Tuning Process

Formula

Optimal Parameters Formula in Fine-Tuning

The optimal parameters, denoted as $\tilde{\theta}$ , obtained through fine-tuning are found by maximizing an objective function over the tuning dataset $\mathcal{D}_{\mathrm{tune}}$ . This relationship is formally expressed as:

$\tilde{\theta} = \argmax_{\hat{\theta}^+} \sum_{\mathrm{sample} \in \mathcal{D}_{\mathrm{tune}}} \mathcal{L}_{\hat{\theta}^+}(\mathrm{sample})$

In this equation, $\hat{\theta}^+$ represents the parameters being actively optimized, which are initialized from the pre-trained parameters $\hat{\theta}$ , and $\mathcal{L}_{\hat{\theta}^+}(\mathrm{sample})$ calculates the objective value for a given sample.

0

1

Updated 2026-04-19

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn Before

Related