Short Answer

Defining the Ground Truth Distribution

A language model with a vocabulary of ['mat', 'hat', 'sat', 'cat', 'the'] is predicting the next word for the context 'The cat sat on the'. The correct next word is 'mat'. To calculate the error for this prediction, a loss function compares the model's predicted probability distribution to a 'gold' or ground truth distribution. Describe the 'gold' distribution for this specific case, representing it as a vector.

0

1

Updated 2025-10-08

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science