1Cademy - Maximum Likelihood Estimation (MLE) Objective in Supervised Language Model Training

Learn Before

Using LLMs to Generate Fine-Tuning Data

Formula

Maximum Likelihood Estimation (MLE) Objective in Supervised Language Model Training

In standard supervised training, the objective for a Large Language Model is to maximize the probability of generating a correct 'gold-standard' output sequence, $y$ , given an input, $x$ . This is achieved through Maximum Likelihood Estimation (MLE), where the model, which produces a series of token distributions, is trained to align these predictions with the one-hot distributions representing the target sequence. The formal objective is to maximize the conditional probability $Pr(y|x)$ .