1Cademy - Logits in Transformer Language Models

Learn Before

Final Hidden States in a Transformer Language Model
Processing Flow of Autoregressive Generation in a Decoder-Only Transformer

Definition

Logits in Transformer Language Models

In Transformer-based language models, logits are the raw, unnormalized scores that are output by the model's final linear layer before the application of a Softmax function. They are represented as a sequence of vectors, $\{z_0, ..., z_{m-1}\}$ , where each vector corresponds to a token position in the sequence. These vectors are generated by projecting the final hidden states ( $h^L$ ) into the vocabulary space, with each element in a vector representing the score for a potential token.