1Cademy - Inferring Decoding Parameters

Learn Before

Example of Top-k Sampling with k=3

Case Study

Inferring Decoding Parameters

A language model is generating the next word after the phrase 'The cat sat on the'. The model's internal calculations produce the initial probabilities for the top five potential words as shown below. The model uses a decoding strategy where only a fixed number of the most likely candidates are considered, and all others are discarded. A final word is then randomly sampled from this smaller group. Analyze the scenario and determine the minimum possible value for the parameter that sets this fixed number of candidates. Explain your reasoning.

Updated 2025-10-10

Contributors are:

Who are from:

Learn Before

Related