Formula

Maximum Likelihood Estimation (MLE) Objective in Supervised Language Model Training

In standard supervised training, the objective for a Large Language Model is to maximize the probability of generating a correct 'gold-standard' output sequence, yy, given an input, xx. This is achieved through Maximum Likelihood Estimation (MLE), where the model, which produces a series of token distributions, is trained to align these predictions with the one-hot distributions representing the target sequence. The formal objective is to maximize the conditional probability Pr(yx)Pr(y|x).

0

1

Updated 2025-10-08

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences