Multiple Choice

A language model has calculated the probabilities for the next possible tokens in a sequence. The five most likely tokens are: 'the' (0.4), 'a' (0.2), 'on' (0.1), 'in' (0.05), and 'at' (0.05). If the model uses a selection process where only the top 3 candidates are considered (k=3), what will be the new, renormalized probability distribution for the tokens that are ultimately sampled from?

0

1

Updated 2025-10-05

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science