1Cademy - A language model has calculated the probabilities for the next possible tokens in a sequence. The five most likely tokens are: the (0.4), a (0.2), on (0.1), in (0.05), and at (0.05). If the model uses a selection process where only the top 3 candidates are considered (k=3), what will be the new, renormalized probability distribution for the tokens that are ultimately sampled from?

Learn Before

Output Stage in Top-k Sampling

Multiple Choice

A language model has calculated the probabilities for the next possible tokens in a sequence. The five most likely tokens are: 'the' (0.4), 'a' (0.2), 'on' (0.1), 'in' (0.05), and 'at' (0.05). If the model uses a selection process where only the top 3 candidates are considered (k=3), what will be the new, renormalized probability distribution for the tokens that are ultimately sampled from?

Updated 2025-10-05

Contributors are:

Who are from:

Learn Before

Related