Short Answer

Calculating Prediction Loss

A language model is processing the input sequence 'The cat sat on the'. The correct next token is 'mat'. The model assigns a probability of 0.25 to the token 'mat' being the correct next token. Calculate the loss for this specific one-token prediction using the negative log-likelihood principle (using the natural logarithm, ln). Show your calculation and briefly explain what a lower value for this loss would signify about the model's prediction.

0

1

Updated 2025-10-04

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.3 Prompting - Foundations of Large Language Models

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science