1Cademy - Sequence Extension with a Sampled Token

Learn Before

Formula for Token Sampling in Autoregressive Models

Formula

Sequence Extension with a Sampled Token

In autoregressive text generation, a new sequence is formed at each step by appending a newly generated token to the existing sequence. If a token $\bar{y}_i$ is sampled at step $i$ , the new sequence, denoted $\bar{\mathbf{y}}$ (or sometimes y¯), is constructed by concatenating the preceding sequence $y_1...y_{i-1}$ with this sampled token. This process is formally represented by the equation: $\bar{\mathbf{y}} = y_1...y_{i-1}\bar{y}_i$