Activity (Process)

Token Sampling from a Conditional Probability Distribution

In autoregressive text generation, after computing the conditional probability distribution for the next token, Pr(yix,y<i)\overline{\text{Pr}}(y_i|\mathbf{x}, \mathbf{y}_{<i}), the next step is to draw a sample from it. This sampling process, which selects a specific token yˉi\bar{y}_i, is formally expressed as drawing from the distribution: yˉiPr(yix,y<i)\bar{y}_i \sim \overline{\text{Pr}}(y_i|\mathbf{x}, \mathbf{y}_{<i})

Image 0

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related