1Cademy - Classifier Head

Learn Before

Encoder-Classifier Architecture

Definition

Classifier Head

A classifier, represented as $Classify_{\omega}(\cdot)$ in the formula, is the final component of a classification model that takes a feature representation (the output of an encoder) as input and predicts a probability distribution over a set of predefined classes. It is often a simple neural network layer, like a linear layer followed by a softmax function. The subscript $\omega$ denotes the learned parameters of this classifier component.

Updated 2025-10-07

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn After

A text classification model is designed to categorize sentences into one of three classes: 'Positive', 'Negative', or 'Neutral'. The model works in two stages: first, it generates a unique numerical vector representation for each input sentence. Second, a final component takes this vector and outputs a probability distribution over the three classes. During testing, you observe that for a wide variety of different input sentences, the model consistently outputs probabilities that are very close
Role of the Final Classification Component
A component is added to a model to predict a probability distribution over a set of predefined classes based on an input feature representation. Match each element of this component to its specific function.

Learn Before

Related

Learn After