Learn Before
Multiple Choice

A language model is being trained on the task of filling in masked words. At an early stage of training, for the sentence 'The sun rises in the [MASK]', the model assigns a probability of 0.05 to the correct word 'east'. After many more rounds of successful training on a large dataset, the model is presented with the same masked sentence. Which of the following outcomes is the most plausible and directly reflects the objective of this training process?

0

1

Updated 2025-10-07

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science