Learn Before
A language model processes text by breaking it into an ordered sequence of tokens, where each token is a unit of text (like a word or punctuation mark) with an associated position. Consider the following two sentences:
Sentence A: 'The fast car races.' Sentence B: 'The fast cars race.'
Which of the following options most accurately represents the distinct token sequences for these two sentences as a typical tokenizer would produce them?
0
1
Tags
Ch.1 Pre-training - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Special Tokens in Language Models
A language model processes text by breaking it into an ordered sequence of tokens, where each token is a unit of text (like a word or punctuation mark) with an associated position. Consider the following two sentences:
Sentence A: 'The fast car races.' Sentence B: 'The fast cars race.'
Which of the following options most accurately represents the distinct token sequences for these two sentences as a typical tokenizer would produce them?
A language model processes text by breaking it down into an ordered sequence of tokens. Arrange the following tokens to reconstruct the original sentence: 'The model predicts the next word .'
Representing Text as a Token Sequence