1Cademy - Span Prediction Loss Formula

Learn Before

Span Prediction Loss Function

Formula

Span Prediction Loss Formula

The loss for a span prediction task is calculated as the average negative log-likelihood of the predicted probabilities for the start and end positions of the answer span. The formula is: $\mathrm{Loss} = -\frac{1}{n} \sum_{j=1}^{n} \big( \log p_j^{\mathrm{beg}} + \log p_j^{\mathrm{end}} \big)$ Where: - $n$ is the number of tokens in the context text. - $p_j^{\mathrm{beg}}$ is the model's predicted probability that token $j$ is the start of the answer span. - $p_j^{\mathrm{end}}$ is the model's predicted probability that token $j$ is the end of the answer span.