1Cademy - Set of Tokens Generated in a Single Speculative Decoding Step

Learn Before

Definition

Set of Tokens Generated in a Single Speculative Decoding Step

In a single step of speculative decoding, the set of newly generated tokens that extends the existing sequence is composed of the consecutively accepted draft tokens and one final token from the verification model. This set is formally represented as: $\lbrace\hat{y}_{i+1}, ..., \hat{y}_{i+n_a}, \bar{y}_{i+n_a+1}\rbrace$ where $\lbrace\hat{y}_{i+1}, ..., \hat{y}_{i+n_a}\rbrace$ are the $n_a$ accepted draft tokens and $\bar{y}_{i+n_a+1}$ is the token generated by the verification model. A more general, simplified notation for this set is $\lbrace\hat{y}, ..., \hat{y}, \bar{y}\rbrace$ , highlighting the composition of accepted draft tokens and a single verification model token.

Updated 2026-06-28

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn Before

Related

Learn After