1Cademy - Effect of Temperature on Probability Distributions

Learn Before

Temperature-Scaled Softmax for Renormalized Probability

Short Answer

Effect of Temperature on Probability Distributions

A language model generates the following raw output scores (logits) for the next three possible tokens: {Token A: 3.0, Token B: 2.0, Token C: 1.0}. Explain how the final probability distribution for these tokens would differ if a temperature parameter of β = 0.5 is used compared to β = 2.0. In your explanation, describe the likely characteristics of the text that would be generated in each case (e.g., more predictable, more creative, etc.).

Updated 2025-10-04

Contributors are:

Who are from:

Learn Before

Related