Short Answer

Language Model Training Task

Consider the following input sequence for a language model: [CLS] The sky is blue . [SEP]. During a specific training step, the word 'is' is selected for prediction. However, due to a particular training rule, the input sequence given to the model remains exactly [CLS] The sky is blue . [SEP]. Describe what the model is tasked to do with the word 'is' in this specific scenario, despite it being visible in the input.

0

1

Updated 2025-10-09

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science