1Cademy - Equivalence of Training Objectives

Learn Before

Training Objective as Joint Log-Likelihood Maximization of Concatenated Sequences

Short Answer

Equivalence of Training Objectives

A language model can be trained by maximizing the joint log-likelihood of a concatenated input-output sequence, log Pr(x, y). This is often treated as equivalent to maximizing the conditional log-likelihood, log Pr(y|x). Explain the mathematical reasoning and the specific condition required for these two objectives to be equivalent in terms of finding the optimal model parameters.

Updated 2025-10-03

Contributors are:

Who are from:

Learn Before

Related