1Cademy - Potential for Learning Superficial Cues in Simple Prediction Tasks

Learn Before

Next Sentence Prediction (NSP)

Concept

Potential for Learning Superficial Cues in Simple Prediction Tasks

Some pre-training objectives, such as predicting the relationship between two simple, consecutive sentences, may not be sufficiently challenging. This can inadvertently train the model to rely on superficial patterns or 'easy evidence' for its predictions, rather than fostering a deeper, more robust understanding of language.

Updated 2025-10-06

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn After

Analysis of a Language Model's Training Objective
A language model is being trained to determine if two sentences are consecutive. For 'positive' examples, it is given two sentences that appear one after the other in a book. For 'negative' examples, the first sentence is from a book about astrophysics, and the second is always from a children's fairy tale. What is the most significant risk associated with this training design?
Evaluating a Model's Training Task

Learn Before

Related

Learn After