Concept

CLS Token as a Start Symbol in Encoder Pre-training

In encoder pre-training and related tasks, the special [CLS][\mathrm{CLS}] token is conventionally used as the start symbol for an input sequence. It is typically the first token, denoted as x0x_0, and serves as a generic start-of-sequence marker.

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.1 Pre-training - Foundations of Large Language Models