Multiple Choice

A language model is being trained on the sentence ' The cat sat'. The model calculates the following conditional log-probabilities at each step, where '' is a fixed start-of-sequence token:

  • log P('The' | '<BOS>') = -1.5
  • log P('cat' | '<BOS>', 'The') = -0.9
  • log P('sat' | '<BOS>', 'The', 'cat') = -1.2

Based on the standard training objective for this single sequence, what is the total log-likelihood value that the model aims to maximize?

0

1

Updated 2025-10-03

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.5 Inference - Foundations of Large Language Models

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science