1Cademy - A machine learning engineer is building a model to classify sentences as either question or statement. They add a special classification token to the beginning of each input sentence before passing it to an encoder. The encoder then produces a final hidden state vector for every token in the input. For the final classification step, which hidden state vector should be used as the representative summary of the entire sentence?

Learn Before

Role of the [CLS] Token in Sequence Classification

Multiple Choice

A machine learning engineer is building a model to classify sentences as either 'question' or 'statement'. They add a special classification token to the beginning of each input sentence before passing it to an encoder. The encoder then produces a final hidden state vector for every token in the input. For the final classification step, which hidden state vector should be used as the representative summary of the entire sentence?

Updated 2025-10-02

Contributors are:

Who are from:

Learn Before

Related