1Cademy - Analyzing Suboptimal Text Generation

Learn Before

Iterative Application of Argmax for Next Token Prediction

Short Answer

Analyzing Suboptimal Text Generation

An autoregressive language model generates text by selecting the single most probable token at each step, based on the sequence generated so far. The model is given the prompt 'The best restaurant in town is known for its delicious food and' and generates the next token 'a', resulting in the sequence '...food and a'. A human might have preferred a completion like '...food and amazing atmosphere'.

Explain how the model's step-by-step selection process could lead to the choice of 'a', even though it results in a less coherent overall sentence.

0

1

Updated 2025-10-10

Contributors are:

Who are from:

Current Context	Next Token	Probability
'The dog'	'barked'	0.7
'The

Learn Before

Related