Multiple Choice

A language model is being trained to predict the next word in a sequence. The training process aims to minimize a loss value, which measures the difference between the model's predicted probability distribution for the next word and the actual correct word. Consider two separate predictions for the next word after the phrase 'The sun is shining...':

  • Prediction A: The model assigns a probability of 0.75 to the correct word, 'brightly'.
  • Prediction B: The model assigns a probability of 0.15 to the correct word, 'brightly'.

Which of the following statements accurately analyzes the loss values for these two predictions?

0

1

Updated 2025-09-26

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science