1Cademy - Token-Level Conditional Log-Probability in Supervised Fine-Tuning

Learn Before

Mathematical Formulation of the Supervised Fine-Tuning Objective

Formula

Token-Level Conditional Log-Probability in Supervised Fine-Tuning

The conditional log-probability $\log \mathrm{Pr}_{\theta}(\mathbf{y}|\mathbf{x})$ for an entire output sequence $\mathbf{y}$ given an input $\mathbf{x}$ is computed at the token level using the chain rule. For an output sequence of length $n$ , the objective sums the log-probabilities of each token $y_i$ , conditioned on both the input $\mathbf{x}$ and all preceding tokens in the output sequence $\mathbf{y}_{<i}$ . This is expressed mathematically as: $\log \mathrm{Pr}_{\theta}(\mathbf{y}|\mathbf{x}) = \sum_{i=1}^{n} \log \mathrm{Pr}_{\theta}(y_i|\mathbf{x},\mathbf{y}_{<i})$ . Minimizing this conditional log-probability is mathematically equivalent to minimizing the cross-entropy loss.

0

1

Updated 2026-04-30

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn Before

Related