Definition

Set of Accepted Draft Tokens

The notation y^i+1,...,y^i+na{\hat{y}_{i+1}, ..., \hat{y}_{i+n_a}} represents the set of draft tokens that have been accepted during a single step of speculative decoding. In this notation, y^\hat{y} denotes a predicted token, the subscript ii refers to the index of the last token in the existing confirmed sequence, and nan_a is the total number of draft tokens that were consecutively accepted.

Image 0

0

1

Updated 2025-10-07

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences