Short Answer

Model Comparison using Conditional Log-Likelihood

A language model is being trained on a small dataset for a classification task. You are evaluating two different sets of model parameters, θA\theta_A and θB\theta_B. For a batch of two examples, the models produce the following probabilities for the correct target classes:

Example 1:

  • Model A: P(y(1)x(1);θA)=0.7P(y^{(1)}|x^{(1)};\theta_A) = 0.7
  • Model B: P(y(1)x(1);θB)=0.9P(y^{(1)}|x^{(1)};\theta_B) = 0.9

Example 2:

  • Model A: P(y(2)x(2);θA)=0.6P(y^{(2)}|x^{(2)};\theta_A) = 0.6
  • Model B: P(y(2)x(2);θB)=0.4P(y^{(2)}|x^{(2)};\theta_B) = 0.4

Calculate the total conditional log-likelihood for the batch for each set of parameters. Based on your calculation, which set of parameters (θA\theta_A or θB\theta_B) is considered a better fit for this data? Use the natural logarithm (ln) for your calculations.

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Data Science

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science