Multiple Choice

A language model is being trained on a task to determine if two sentences are consecutive. For a specific pair of sentences where the second sentence is the correct follow-up, the model's final classifier outputs a probability of 0.8 for the 'IsNext' label. Based on the standard negative log-likelihood loss function used for this task, what is the calculated loss value for this single training example? (Note: Use the natural logarithm, ln).

0

1

Updated 2025-09-26

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science