Definition

Verification Model in Speculative Decoding

The verification model is the full-sized, accurate language model whose inference process is being accelerated. Its role is to efficiently check the correctness of the token sequence proposed by the draft model. It can evaluate these tokens in parallel. If the draft sequence is incorrect, the verification model discards the invalid tokens and is then used to generate the correct tokens itself before the process continues.

0

1

Updated 2026-05-05

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Learn After