1Cademy - Evaluating Proposed Tokens in a Generation Process

Learn Before

Acceptance-Rejection Mechanism for Speculative Decoding

Case Study

Evaluating Proposed Tokens in a Generation Process

A text generation system uses a small 'draft' model and a large 'target' model to speed up output. The draft model proposes a sequence of tokens, and an acceptance-rejection mechanism decides whether to keep them. For each proposed token, analyze the provided probabilities and determine if the token is (A) accepted outright, or (B) subject to a probabilistic check. If it's subject to a probabilistic check, calculate the specific probability of rejection.

Updated 2025-10-05

Contributors are:

Who are from:

Learn Before

Related