1Cademy - Probability Renormalization Formula for Top-k Sampling

Learn Before

Top-k Sampling Process

Formula

Probability Renormalization Formula for Top-k Sampling

In top-k sampling, after identifying the pool of the k most probable tokens ( $\overline{V}_i$ ), their probabilities are renormalized to form a new distribution that sums to 1. The renormalized probability of a token $y_i$ from this pool is calculated by dividing its original probability by the sum of the original probabilities of all tokens in the pool: $\overline{\text{Pr}}(y_i|\mathbf{x}, \mathbf{y}_{<i}) = \frac{\text{Pr}(y_i|\mathbf{x}, \mathbf{y}_{<i})}{\sum_{y_j \in \overline{V}_i} \text{Pr}(y_j|\mathbf{x}, \mathbf{y}_{<i})}$ This ensures that the new probabilities for the tokens in $\overline{V}_i$ sum to 1.