1Cademy - Unintended Learning in Sentence Relationship Models

Learn Before

Limitation of Next Sentence Prediction: Reliance on Superficial Cues

Short Answer

Unintended Learning in Sentence Relationship Models

A language model is trained on a task where it must determine if Sentence B is the actual sentence that follows Sentence A. For negative examples (where B is not the next sentence), the training data is constructed by always pairing Sentence A with a random sentence from a completely different document. Explain a potential superficial shortcut the model might learn from this setup, and why this shortcut fails to capture a true understanding of sentence coherence.

Updated 2025-10-06

Contributors are:

Who are from:

Learn Before

Related