Causation

Simplicity of NSP Task as a Cause for Reliance on Superficial Cues

The Next Sentence Prediction (NSP) task can lead a model to rely on superficial evidence because the prediction challenge is often not difficult. When presented with sentence pairs that have an obvious connection, the model can learn to identify simple correlations rather than developing a more robust, deeper understanding of semantic coherence.

0

1

Updated 2025-10-10

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences