1Cademy - Formal Derivation of the Top-k Selection Pool

Learn Before

Top-k Selection Pool
argTopK Function

Formula

Formal Derivation of the Top-k Selection Pool

The selection pool in top-k sampling, $V_i$ , is formally derived by identifying the $K$ tokens with the highest conditional probabilities from the entire vocabulary at each generation step $i$ . This selection process is formalized using the argTopK function, which ranks the prediction probabilities of all possible next tokens and returns the top $K$ . The resulting selection pool is thus defined as: $V_i = \{y_i^{\text{top1}}, \dots, y_i^{\text{topk}}\} = \underset{y_i \in V}{\text{argTopK}} \, \text{Pr}(y_i|\mathbf{x}, \mathbf{y}_{<i})$ where the probability is conditioned on the input x and the preceding token sequence y_{<i}.