1Cademy - argTopK Function

Learn Before

Top-k Sampling

Definition

argTopK Function

The argTopK function is an operator that identifies the K items with the highest values from a given set. In the context of language models, it is applied to the probability distribution over the entire vocabulary to rank all possible next tokens and return the set of the K most probable candidates.

Updated 2025-10-07

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn After

Mathematical Definition of Top-K Token Selection
A language model is generating text and needs to decide on the next token. It has calculated the following probabilities for a small set of possible tokens: {'over': 0.12, 'the': 0.35, 'a': 0.28, 'under': 0.05, 'quick': 0.20}. If an operator is applied to this set to identify the K=3 tokens with the highest probability values, which set of tokens will be returned?
Analyzing the Impact of the 'K' Parameter on Token Selection
When generating the next token in a sequence, applying an operator that identifies the K items with the highest values with the parameter K set to 1 will produce a different set of candidate tokens than simply selecting the single token with the highest probability.
Formula for the Top-k Selection Pool

Learn Before

Related

Learn After