1Cademy - Diagnosing a Language Models Flawed Coherence Judgment

Learn Before

Limitation of Next Sentence Prediction: Reliance on Superficial Cues

Case Study

Diagnosing a Language Model's Flawed Coherence Judgment

Based on the description of the training process and the observed failure in the case study below, what is the most likely reason for the model's poor performance on the downstream task? Explain how the training setup encouraged this specific type of error.

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences