1Cademy - Limitation of Next Sentence Prediction: Reliance on Superficial Cues

Learn Before

Next Sentence Prediction (NSP)

Concept

Limitation of Next Sentence Prediction: Reliance on Superficial Cues

A key criticism of the Next Sentence Prediction (NSP) task is that it may not be sufficiently challenging. This relative simplicity can encourage the model to learn to make predictions based on superficial evidence rather than developing a deeper semantic understanding of the relationship between sentences.

Updated 2026-01-15

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn After

A language model is being trained on a task where it must determine if Sentence B is the actual sentence that follows Sentence A in a document. Which of the following training pairs is most likely to encourage the model to learn a simple, superficial shortcut for this task, rather than developing a deeper understanding of semantic coherence?
Simplicity of NSP Task as a Cause for Reliance on Superficial Cues
Diagnosing a Language Model's Flawed Coherence Judgment
Unintended Learning in Sentence Relationship Models

Learn Before

Related

Learn After