Concept

Span Prediction Loss Function

The training objective for a span prediction model involves calculating a loss based on the outputs of its start-of-span and end-of-span prediction networks. The total loss is computed by summing the negative log-likelihoods from both networks across all tokens within the context passage.

Image 0

0

1

Updated 2026-04-18

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.1 Pre-training - Foundations of Large Language Models