Set of Accepted Draft Tokens
The notation represents the set of draft tokens that have been accepted during a single step of speculative decoding. In this notation, denotes a predicted token, the subscript refers to the index of the last token in the existing confirmed sequence, and is the total number of draft tokens that were consecutively accepted.

0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Related
Post-Acceptance Token Generation in Speculative Decoding
Set of Accepted Draft Tokens
Set of Tokens Generated in a Single Speculative Decoding Step
In a text generation process designed for speed, an initial sequence
['The', 'cat', 'sat']is extended. A fast proposal mechanism suggests the candidate tokens['on', 'the', 'mat']. A more accurate, final-check mechanism then processes these candidates and produces the final, complete sequence:['The', 'cat', 'sat', 'on', 'the', 'rug']. Based on this outcome, how many of the candidate tokens were accepted before the final-check mechanism generated its own token?In a text generation process that uses a fast model to propose candidate tokens and a more accurate main model to check them, a single generation step has just completed. Arrange the following components to correctly represent the structure of the full, updated text sequence.
Visual Representation of a Speculative Decoding Step's Output
Analyzing a Speculative Generation Step
Learn After
In a speculative decoding process, the sequence of confirmed tokens before a specific step is
['The', 'quick', 'brown', 'fox']. After this single step completes, the new confirmed sequence is['The', 'quick', 'brown', 'fox', 'jumps', 'over', 'the']. Given this outcome, which of the following represents the set of accepted draft tokens, denoted as ?Sequence Construction in Speculative Decoding
A text generation process begins with the confirmed sequence
['The', 'quick', 'brown']. After a single generation step, the new confirmed sequence is['The', 'quick', 'brown', 'fox', 'jumps', 'over']. Based on this, evaluate the following statement: 'The set of accepted draft tokens, represented as , for this step is['fox', 'jumps', 'over'].'