Multiple Choice

A language model is being trained to predict the next word in a sentence. For the input context 'The sun is shining...', the ideal (target) probability distribution, denoted as PrtPr^t, gives a high probability to the word 'brightly'. The model's performance is measured by a loss function that compares the model's predicted probability distribution, PrθsPr_θ^s, to the target distribution.

Consider two different sets of model parameters, θ₁ and θ₂:

  • With parameters θ₁, the model's distribution Prθ1sPr_{θ₁}^s predicts 'brightly' with a high probability.
  • With parameters θ₂, the model's distribution Prθ2sPr_{θ₂}^s predicts 'darkly' with a high probability.

Which of the following statements correctly analyzes the relationship between the parameters and the loss function for this specific input?

0

1

Updated 2025-09-28

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science