Short Answer

Verifying Log-Probability Equivalence

A language model is trained to predict sequences of words. Consider the three-word sequence (x₀, x₁, x₂) = ('the', 'cat', 'sat'). The model assigns the following probabilities:

  • The joint probability of the entire sequence:
    Pr('the', 'cat', 'sat') = 0.01

  • The individual conditional probabilities:

    • Pr('the') = 0.1
    • Pr('cat' | 'the') = 0.5
    • Pr('sat' | 'the', 'cat') = 0.2

Your task is to demonstrate the mathematical equivalence between the log-probability of the entire sequence and the sum of the conditional log-probabilities. Calculate both values using the natural logarithm (ln) and show that they are equal.

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science