Concept

Next Sentence Prediction as an Auxiliary Training Objective

The Next Sentence Prediction (NSP) task is typically not the sole objective during model pre-training. Instead, it is often employed as an additional, or auxiliary, training loss. This means the model is trained to optimize for the NSP objective simultaneously with other objectives, such as Masked Language Modeling, to learn a more comprehensive understanding of language.

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related