Short Answer

Calculating Sequence Log-Likelihood

A language model is being trained with the objective of maximizing the log-likelihood of sequences. For the specific sequence 'The cat sat', the model computes the following conditional log-probabilities for the actual next token at each position (assuming a fixed start-of-sequence token):

  • log Pr('The' | start_token) = -1.5
  • log Pr('cat' | start_token, 'The') = -0.8
  • log Pr('sat' | start_token, 'The', 'cat') = -1.2

Calculate the total log-likelihood for this entire sequence, which represents the value the model aims to maximize for this training example.

0

1

Updated 2025-09-26

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.5 Inference - Foundations of Large Language Models

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science