1Cademy - Factors Influencing Multilingual Pre-training

Learn Before

Multilingual and Language-Specific PTMs

Concept

Factors Influencing Multilingual Pre-training

The effectiveness of a multilingual pre-trained model, assuming a fixed architecture, is determined by several key configuration choices. These include the size of the shared vocabulary, the proportion of training data allocated to each language, and the overall size of the model itself.

Updated 2026-04-18

Contributors are: