1Cademy - A language model is being trained on a large dataset of text sequences. After a single parameter update, the models calculated log-probability for one specific sequence in the dataset increases by 2.5, while the log-probabilities for all other sequences in the dataset remain exactly the same. How does this change affect the overall maximum likelihood training objective for the entire dataset?

Learn Before

Maximum Likelihood Training Objective for a Dataset of Sequences

Multiple Choice

A language model is being trained on a large dataset of text sequences. After a single parameter update, the model's calculated log-probability for one specific sequence in the dataset increases by 2.5, while the log-probabilities for all other sequences in the dataset remain exactly the same. How does this change affect the overall maximum likelihood training objective for the entire dataset?

Updated 2025-10-08

Contributors are:

Who are from:

Learn Before

Related