Case Study

Interpreting a Model's Output Distribution

A language model is being trained on a sentence where one word has been masked. For the masked position, the model's task is to predict the original word. The original word was 'ocean'. The model outputs the following probability distribution over a simplified vocabulary for that position. Based on this distribution, analyze the model's current understanding and explain how this specific output is used to update the model.

0

1

Updated 2025-10-03

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science