1Cademy - Next Sentence Prediction Loss Formula

Learn Before

Classification on Sequence Representation

Formula

Next Sentence Prediction Loss Formula

For classification problems like Next Sentence Prediction (NSP), the loss function under maximum likelihood training is defined as the negative log-probability of the correct label given the sequence representation. The specific formula is:

${}\mathrm{Loss}_{\mathrm{NSP}} = -\log \Pr(c_{\mathrm{gold}}|\mathbf{h}_{\mathrm{cls}})$

where ${}c_{\mathrm{gold}}$ represents the correct (or 'gold') label for the current sample, and ${}\mathbf{h}_{\mathrm{cls}}$ is the aggregate sequence representation vector.