1Cademy - Interpreting the Formal Definition of Top-k Selection

Learn Before

Formula for the Top-k Selection Pool

Short Answer

Interpreting the Formal Definition of Top-k Selection

A junior developer is implementing a text generation algorithm. At a specific step i, the model's vocabulary is V = {'the', 'a', 'cat', 'dog'} and the next-token probabilities are Pr('the'|...) = 0.4, Pr('a'|...) = 0.1, Pr('cat'|...) = 0.3, Pr('dog'|...) = 0.2. For K=2, the developer's code outputs the selection pool $V_i$ as {0.4, 0.3}. Based on the formal definition $\large V_i = \underset{y_i \in V}{\text{argTopK}} , \text{Pr}(y_i|\mathbf{x}, \mathbf{y}_{<i})$ explain the fundamental mistake in the developer's output.

Updated 2025-10-08

Contributors are:

Who are from:

Learn Before

Related