1Cademy - Interpreting a Models Output Distribution

Learn Before

Predicted Probability Distribution in MLM

Case Study

Interpreting a Model's Output Distribution

A language model is being trained on a sentence where one word has been masked. For the masked position, the model's task is to predict the original word. The original word was 'ocean'. The model outputs the following probability distribution over a simplified vocabulary for that position. Based on this distribution, analyze the model's current understanding and explain how this specific output is used to update the model.

Updated 2025-10-03

Contributors are:

Who are from:

Learn Before

Related