Learn Before
Concept

Early Stopping in Multilingual Pre-training

To counteract the phenomenon of interference, where a multilingual model's overall performance begins to decline after an extended period of training, practical systems often implement early stopping. This technique involves halting the pre-training process before the degradation of performance occurs.

0

1

Updated 2026-04-18

Contributors are:

Who are from:

Tags

Foundations of Large Language Models

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences