Rejection Criterion in Speculative Sampling
In speculative sampling, a generated token is considered for rejection if its probability under the draft model, , is greater than its probability under the target model, . When this condition is met, the token is not rejected outright, but rather with a specific probability calculated as $1 - \frac{p(\hat{y}{i+t})}{q(\hat{y}{i+t})}$.

0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Related
Acceptance-Rejection Mechanism for Speculative Decoding
Inequality Constraint for Predicted Future Value Functions ()
Condition for Rejecting Speculation
Consider a text generation system that uses a fast, approximate model to propose a potential future word. For each proposed word, a more accurate but slower model also calculates a probability. Suppose at a certain step
i, the fast model predicts the next word will be 'universe' (represented as ). The fast model's confidence in this specific prediction is calculated and denoted as . Based on this information, what is the most accurate interpretation of the value 0.8?In the context of a system that generates sequences of values (like words in a sentence), the expression is often used. Match each component of this expression to its correct description.
Rejection Criterion in Speculative Sampling
Interpreting Model Predictions
Acceptance-Rejection Mechanism for Speculative Decoding
Evaluation Model for Predicted Sequences
Inequality Constraint for Predicted Future Value Functions ()
A text-generation model has produced the sequence 'The quick brown fox'. The model is now at the 4th position (after 'fox') and is predicting the next word in the sequence. It predicts the word 'jumps' will appear at the 5th position. Which of the following expressions correctly represents the probability assigned by the model to this specific prediction?
In the context of a model generating a sequence of values, match each component of the notation
p(ŷ_{i+t})to its correct description.Rejection Criterion in Speculative Sampling
Interpreting a Weather Forecasting Model's Output
Learn After
In a text generation process, a small 'draft' model proposes the next token, 'evolution'. The probability assigned to this token by the draft model is 0.8. A larger, more accurate 'target' model then evaluates the same token and assigns it a probability of 0.6. Based on this information, what is the probability that the token 'evolution' will be rejected?
In a speculative sampling process, a draft model proposes four potential next tokens. A more powerful target model then evaluates the probability of each token. Given the probabilities below, which token has the highest probability of being rejected?
Analyzing Speculative Sampling Acceptance