1Cademy - Construction of Top-K Candidate Sequences in Beam Search

Learn Before

Top-K Token Selection in Beam Search
Mathematical Definition of Top-K Token Selection

Formula

Construction of Top-K Candidate Sequences in Beam Search

After identifying the $K$ most probable next tokens at step $i$ , the set of top- $K$ candidate sequences is formed by appending each of these tokens to the parent sequence, $y_1...y_{i-1}$ . For each token $y_i^{\text{top k}}$ in the set of top $K$ tokens (where $k$ ranges from 1 to $K$ ), a new candidate sequence $\mathbf{y}^{\text{top k}}$ is constructed as follows: $\mathbf{y}^{\text{top k}} = y_1...y_{i-1}y_i^{\text{top k}}$ This process expands a single parent hypothesis into $K$ new, longer hypotheses to be considered in the next step of the beam search.

Updated 2026-06-27

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Word	Probability
mat	0.45
rug	0.25
chair	0.15
floor	0.10
table	0.03
window	0.02

Learn After

Formula for the Candidate Set in Beam Search
In a text generation process, a single partial sequence is being expanded. The current sequence is 'The sun is shining', and the three most probable next words have been identified as 'brightly', 'today', and 'and'. Based on this information, what will be the new set of candidate sequences to consider for the next step?
Error Analysis in Sequence Expansion
In a text generation process, several partial sequences (parent hypotheses) are being considered. For each parent hypothesis, the three most probable next tokens have been identified. Match each parent hypothesis to its correctly constructed set of new, longer candidate sequences.

Learn Before

Related

Learn After