1Cademy - Mathematical Formulation of Verification Model Evaluation in Speculative Decoding

Learn Before

Formula

Mathematical Formulation of Verification Model Evaluation in Speculative Decoding

In speculative decoding, the verification model evaluates the entire sequence of $\tau$ draft tokens, $\{\hat{y}_{i+1}, \ldots, \hat{y}_{i+\tau}\}$ , in a single, parallel step. This is achieved by computing the conditional probability for each draft token using the verification model’s distribution, $\Pr_p$ . The probability for each token $\hat{y}_{i+t}$ is conditioned on the original prefix $[\mathbf{x}, \mathbf{y}_{\le i}]$ and all preceding draft tokens $\hat{y}_{i+1}, \ldots, \hat{y}_{i+t-1}$ . The set of probabilities computed is: $\Big\{ \Pr_p(\hat{y}_{i+1} \mid \mathbf{x}, \mathbf{y}_{\le i}), ; \ldots, ; \Pr_p(\hat{y}_{i+\tau} \mid \mathbf{x}, \mathbf{y}_{\le i}, \hat{y}_{i+1}, \ldots, \hat{y}_{i+\tau-1}) \Big\}$ .

Updated 2026-06-30

Contributors are:

Who are from:

References

Learn Before

Related

Learn After