Definition

Logits in Transformer Language Models

In Transformer-based language models, logits are the raw, unnormalized scores that are output by the model's final linear layer before the application of a Softmax function. They are represented as a sequence of vectors, {z0,...,zm1}\{z_0, ..., z_{m-1}\}, where each vector corresponds to a token position in the sequence. These vectors are generated by projecting the final hidden states (hLh^L) into the vocabulary space, with each element in a vector representing the score for a potential token.

Image 0

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related