Multiple Choice

A language model is being trained with a supervised objective to maximize the probability of the correct output. Given the input 'The largest city in the US is', the target output is the two-token sequence 'New York'. Two different models are evaluated on this single instance.

  • Model A predicts the first token 'New' with a probability of 0.6, and then predicts the second token 'York' with a probability of 0.8.
  • Model B predicts the first token 'New' with a probability of 0.9, and then predicts the second token 'York' with a probability of 0.4.

Based on the standard training objective for this task, which statement correctly analyzes the models' performance on this specific example?

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science