1Cademy - In a pipeline designed for sentence-pair classification, an input like `[CLS] sentence A [SEP] sentence B [SEP]` is processed by an encoder to produce a sequence of contextualized encodings, one for each token. For the final classification, only the encoding corresponding to the `[CLS]` token is passed to a Softmax layer. What is the most accurate reason for selecting this specific encoding to represent the entire input?

Learn Before

Schematic Example of a Sentence-Pair Classification Pipeline

Multiple Choice

In a pipeline designed for sentence-pair classification, an input like [CLS] sentence A [SEP] sentence B [SEP] is processed by an encoder to produce a sequence of contextualized encodings, one for each token. For the final classification, only the encoding corresponding to the [CLS] token is passed to a Softmax layer. What is the most accurate reason for selecting this specific encoding to represent the entire input?

Updated 2025-09-26

Contributors are:

Who are from:

Learn Before

Related