Learn Before
A researcher is preparing the sentence 'Language models are powerful.' for input into a specific type of model. This model architecture requires a special token, denoted as [CLS], to be placed at the very beginning of every input sequence to mark its start. Which of the following options correctly formats the sentence for this model?
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A researcher is preparing the sentence 'Language models are powerful.' for input into a specific type of model. This model architecture requires a special token, denoted as
[CLS], to be placed at the very beginning of every input sequence to mark its start. Which of the following options correctly formats the sentence for this model?The
[CLS]token, when used in a language model, is exclusively for aggregating sequence information for downstream classification tasks and does not serve as the initial token required to start the sequence's probability calculation.The Role of the Initial Token in Sequence Probability