1Cademy - Correcting a Step in a Generation Process

Learn Before

Diagram of Post-Acceptance Token Prediction in Speculative Decoding

Short Answer

Correcting a Step in a Generation Process

In a text generation process, a fast draft model proposes the token sequence ['over', 'the', 'lazy']. A more accurate evaluation model accepts the first two tokens, 'over' and 'the', but rejects the third token, 'lazy'. To continue generating the text, which model should be used to determine the very next token, and based on what preceding text sequence?

Updated 2025-10-04

Contributors are:

Who are from:

Learn Before

Related