1Cademy - Sample-wise Negative Log-Likelihood Loss for a Sub-sequence

Metric A: A value calculated for each individual data sample. This value fluctuates significantly from one sample to the next.
Metric B: A single, aggregate value calculated after the model has processed the entire training dataset. This value shows a steady, downward trend over multiple passes through the dataset.

Learn Before

Loss Function vs. Cost Function
Sequence-Level Loss
Structuring a Sample from Input and Output Segments

Formula

Sample-wise Negative Log-Likelihood Loss for a Sub-sequence

When evaluating a model on a specific training instance, the loss function is calculated solely over the target sub-sequence, $\mathbf{y}_{\mathrm{sample}}$ , instead of the full sequence. For a model defined by parameters $\hat{\theta}^+$ , this loss is expressed as the negative log-likelihood of the probability of generating the output sub-sequence, given the input sub-sequence $\mathbf{x}_{\mathrm{sample}}$ . The formula is: