Case Study

Diagnosing a Language Model's Training Deficiency

Based on the model's performance described in the case study, which common pre-training objective was likely omitted or given insufficient weight during its training? Explain why including this objective as an auxiliary task would have addressed the observed performance gap.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science