1Cademy - A language model is being trained on the task of filling in masked words. At an early stage of training, for the sentence The sun rises in the [MASK], the model assigns a probability of 0.05 to the correct word east. After many more rounds of successful training on a large dataset, the model is presented with the same masked sentence. Which of the following outcomes is the most plausible and directly reflects the objective of this training process?

Learn Before

Probability of a True Token in MLM

Multiple Choice

A language model is being trained on the task of filling in masked words. At an early stage of training, for the sentence 'The sun rises in the [MASK]', the model assigns a probability of 0.05 to the correct word 'east'. After many more rounds of successful training on a large dataset, the model is presented with the same masked sentence. Which of the following outcomes is the most plausible and directly reflects the objective of this training process?

Updated 2025-10-07

Contributors are:

Who are from:

Learn Before

Related