1Cademy - Search for Optimal Output Sequence in LLMs

Learn Before

Mathematical Formulation of the Search Problem in LLM Inference

Activity (Process)

Search for Optimal Output Sequence in LLMs

The search process in language model inference aims to identify an output sequence y that is either optimal or sub-optimal based on its conditional log-probability, log Pr(y|x), given an input x. The objective is to find a sequence that maximizes this metric.

Updated 2026-04-19

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn After

A language model's inference process aims to find an output sequence y that maximizes the conditional probability Pr(y|x) given an input x. Suppose the model has the input 'The sun is shining and the sky is' and calculates the probabilities for the next word as follows:
- Pr('blue' | 'The sun is shining and the sky is') = 0.65
- Pr('clear' | 'The sun is shining and the sky is') = 0.25
- Pr('vast' | 'The sun is shining and the sky is') = 0.09
- `Pr('falling' | 'The sun is shining and
A language model's objective is to find the output sequence with the highest overall conditional probability. Given the input 'The weather is', the model needs to generate a two-word sequence. It has calculated the following probabilities:

Probabilities for the first word:
- Pr('nice' | 'The weather is') = 0.6
- Pr('cold' | 'The weather is') = 0.4
Probabilities for the second word, depending on the first:
- Pr('today' | 'The weather is nice') = 0.5
- Pr('and' | 'The weather is cold') = 0.9
Ba
Comparing Output Sequence Probabilities
Formula for Optimal Output Sequence in LLMs

Learn Before

Related

Learn After