1Cademy - Visual Representation of a Speculative Decoding Steps Output

Learn Before

Structure of the Full Sequence After a Speculative Decoding Step

Example

Visual Representation of a Speculative Decoding Step's Output

This diagram illustrates the composition of the output sequence after a single step of speculative decoding. The sequence is formed by three distinct parts: 1. The initial Context, represented as $[\mathbf{x}, \mathbf{y}_{\le i}]$ , which includes the prompt and all previously confirmed tokens. 2. A sequence of $n_a$ accepted draft tokens, $\hat{y}_{i+1} \dots \hat{y}_{i+n_a}$ , which were predicted by the draft model. 3. One final token, $\bar{y}_{i+n_a+1}$ , which is predicted by the verification model to extend the sequence.

Updated 2026-06-29

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn Before

Related