1Cademy - Formula for Constructing Top-K Candidate Sequences

Learn Before

Mathematical Definition of Top-K Token Selection
Notation for Preceding Output Subsequence

Formula

Formula for Constructing Top-K Candidate Sequences

In the beam search algorithm, the top $K$ candidate sequences for step $i$ are generated by extending the previous step's prefixes with the newly selected tokens. Each new sequence is formed by appending a top token: $\mathbf{y}^{\mathrm{top}k} = y_1...y_{i-1} y_i^{\mathrm{top}k}$ . The final candidate set, denoted $Y_i$ , is then formally defined as $Y_i = \{\mathbf{y}^{\mathrm{top}1},...,\mathbf{y}^{\mathrm{top}K}\}$ , where $K$ represents the beam width.