1Cademy - Formula for Optimal Output Sequence in LLMs

Learn Before

Search for Optimal Output Sequence in LLMs

Formula

Formula for Optimal Output Sequence in LLMs

In language model inference, the optimal output sequence, denoted as $\hat{\mathbf{y}}$ , is found by maximizing the conditional log probability of the output sequence given the input sequence $\mathbf{x}$ . This objective is formally expressed by decomposing the joint probability over the $n$ output tokens: $\hat{\mathbf{y}} = \argmax_{\mathbf{y}} \log \Pr(\mathbf{y} | \mathbf{x}) = \argmax_{\mathbf{y}} \sum_{i=1}^{n} \log \Pr(y_i|x_0,...,x_m,y_1,...,y_{i-1})$ In this formulation, the input sequence is represented as $x_0,...,x_m$ , and the equation models the log probability of predicting subsequent tokens starting from position $m+1$ , rather than position ${}0$ .

0

1

Updated 2026-04-19

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn Before

Related