Learn Before
Multiple Choice

A research team is training a large multilingual language model on a dataset containing English, Spanish, and Swahili. They observe that after an extensive number of training steps, the model's performance on a Swahili translation task begins to degrade, even though its performance on English remains strong and the overall training loss continues to decrease. Which of the following concepts best explains this specific outcome?

0

1

Updated 2025-09-29

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Computing Sciences

Foundations of Large Language Models Course

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science