1Cademy - Temperature-Scaled Softmax for Renormalized Probability

Learn Before

Using Temperature with Softmax to Control Randomness in Token Selection

Formula

Temperature-Scaled Softmax for Renormalized Probability

To control the randomness in token selection, the probability distribution can be reshaped using a temperature parameter, $\beta$ . The renormalized conditional probability of a token $y_i$ , given the context $(\mathbf{x}, \mathbf{y}_{<i})$ , is calculated by applying a temperature-scaled Softmax function to its logit, $u_{y_i}$ , and normalizing over a restricted set of candidate tokens $\overline{V}_i$ . The formula is: $\overline{\text{Pr}}(y_i|\mathbf{x}, \mathbf{y}_{<i}) = \frac{\exp(u_{y_i}/\beta)}{\sum_{y_j \in \overline{V}_i} \exp(u_{y_j}/\beta)}$