Activity (Process)

Rejection Criterion in Speculative Sampling

In speculative sampling, a generated token y^i+t\hat{y}_{i+t} is considered for rejection if its probability under the draft model, q(y^i+t)q(\hat{y}_{i+t}), is greater than its probability under the target model, p(y^i+t)p(\hat{y}_{i+t}). When this condition is met, the token is not rejected outright, but rather with a specific probability calculated as $1 - \frac{p(\hat{y}{i+t})}{q(\hat{y}{i+t})}$.

Image 0

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related