1Cademy - Formula for Token Sampling in Autoregressive Models

Learn Before

Token Sampling from a Conditional Probability Distribution

Formula

Formula for Token Sampling in Autoregressive Models

In autoregressive models, the selection of the next token, $\bar{y}_i$ , is formally represented as drawing a sample from the model's conditional probability distribution. This is expressed by the formula: $\bar{y}_i \sim \overline{\text{Pr}}(y_i|\mathbf{x}, \mathbf{y}_{<i})$ This notation signifies that the token $\bar{y}_i$ is sampled from the probability distribution over all possible tokens $y_i$ , conditioned on the input context $\mathbf{x}$ and the sequence of previously generated tokens $\mathbf{y}_{<i}$ . The context of preceding tokens, $\mathbf{y}_{<i}$ , is sometimes written more compactly as $\mathbf{y}_i$ .

0

1

Updated 2025-10-08

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn Before

Related

Learn After