1Cademy - Defining the Ground Truth Distribution

Learn Before

Loss Function for Language Modeling

Short Answer

Defining the Ground Truth Distribution

A language model with a vocabulary of ['mat', 'hat', 'sat', 'cat', 'the'] is predicting the next word for the context 'The cat sat on the'. The correct next word is 'mat'. To calculate the error for this prediction, a loss function compares the model's predicted probability distribution to a 'gold' or ground truth distribution. Describe the 'gold' distribution for this specific case, representing it as a vector.

Updated 2025-10-08

Contributors are:

Who are from:

Learn Before

Related