1Cademy - Effect of Candidate Pool Size on Text Generation

Learn Before

Top-k Sampling

Case Study

Effect of Candidate Pool Size on Text Generation

A language model is tasked with completing the sentence 'The sun began to set over the...'. It uses a decoding strategy where, at each step, it considers only a fixed number ('k') of the most likely next words to choose from. Below are two outputs generated by the model using two different settings for 'k'.

Output A: '...ocean. The waves crashed on the shore. The sky turned orange.'

Output B: '...crystal spires. The air hummed with forgotten magic. The sky bled purple.'

Analyze the two outputs. Which output was likely generated using a very small value for 'k' (e.g., k=3), and which was likely generated using a much larger value (e.g., k=50)? Justify your reasoning by explaining the relationship between the size of the candidate word pool and the characteristics of the generated text.

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Learn Before

Related