1Cademy - A language model is generating the next word and has calculated the following probabilities for the most likely tokens: Token A (0.40), Token B (0.30), Token C (0.15), Token D (0.10), and Token E (0.05). If the model uses a sampling strategy where it forms a candidate pool by including the most probable tokens until their cumulative probability just exceeds a threshold of 0.75, what will be the size of this candidate pool?

Context A: The single most probable token has a probability of 0.95.
Context B: The ten most probable tokens each have a probability of 0.09.

Learn Before

Candidate Pool Size in Top-p Sampling (kp)

Multiple Choice

A language model is generating the next word and has calculated the following probabilities for the most likely tokens: Token A (0.40), Token B (0.30), Token C (0.15), Token D (0.10), and Token E (0.05). If the model uses a sampling strategy where it forms a candidate pool by including the most probable tokens until their cumulative probability just exceeds a threshold of 0.75, what will be the size of this candidate pool?

Updated 2025-09-26

Contributors are:

Who are from:

Learn Before

Related