1Cademy - A language model predicts the probabilities for the next word in a sequence. The top four candidates are: happy (0.4), sad (0.2), angry (0.1), and joyful (0.05). A decoding method is applied that restricts the possible choices to only the top three candidates (happy, sad, angry). After the probabilities for this smaller set are rescaled to form a new, valid probability distribution, what is the new probability for the word sad?

Learn Before

Probability Renormalization Formula for Restricted Vocabulary Sampling

Multiple Choice

A language model predicts the probabilities for the next word in a sequence. The top four candidates are: 'happy' (0.4), 'sad' (0.2), 'angry' (0.1), and 'joyful' (0.05). A decoding method is applied that restricts the possible choices to only the top three candidates ('happy', 'sad', 'angry'). After the probabilities for this smaller set are rescaled to form a new, valid probability distribution, what is the new probability for the word 'sad'?

Updated 2025-09-28

Contributors are:

Who are from:

Learn Before

Related