1Cademy - Token Sampling from a Conditional Probability Distribution

Distribution A: {&#x27;meal&#x27;: 0.90, &#x27;dish&#x27;: 0.05, &#x27;surprise&#x27;: 0.03, &#x27;error&#x27;: 0.02}
Distribution B: {&#x27;soup&#x27;: 0.30, &#x27;stew&#x27;: 0.25, &#x27;salad&#x27;: 0.22, &#x27;dessert&#x27;: 0.23}

Learn Before

Conditional Probability Formula for Autoregressive Models using Softmax
Token Selection from Probability Distribution
Using Temperature with Softmax to Control Randomness in Token Selection
Temperature-Scaled Softmax for Renormalized Probability

Activity (Process)

Token Sampling from a Conditional Probability Distribution

In autoregressive text generation, after computing the conditional probability distribution for the next token, $\overline{\text{Pr}}(y_i|\mathbf{x}, \mathbf{y}_{<i})$ , the next step is to draw a sample from it. This sampling process, which selects a specific token $\bar{y}_i$ , is formally expressed as drawing from the distribution: $\bar{y}_i \sim \overline{\text{Pr}}(y_i|\mathbf{x}, \mathbf{y}_{<i})$