1Cademy - Calculating Prediction Loss

Learn Before

Sample-wise Negative Log-Likelihood Loss for a Sub-sequence

Short Answer

Calculating Prediction Loss

A language model is processing the input sequence 'The cat sat on the'. The correct next token is 'mat'. The model assigns a probability of 0.25 to the token 'mat' being the correct next token. Calculate the loss for this specific one-token prediction using the negative log-likelihood principle (using the natural logarithm, ln). Show your calculation and briefly explain what a lower value for this loss would signify about the model's prediction.

Updated 2025-10-04

Contributors are:

Who are from:

Learn Before

Related