1Cademy - A team is building a single encoder-decoder model intended to translate between Japanese, Korean, and Mandarin. They pre-train the model on a large, combined corpus of all three languages. However, instead of creating a unified vocabulary that includes tokens from all three languages, they use three separate, language-specific vocabularies. What is the most direct and critical consequence of this design choice on the models translation performance?

Learn Before

Multi-lingual Pre-training for Encoder-Decoder Models

Multiple Choice

A team is building a single encoder-decoder model intended to translate between Japanese, Korean, and Mandarin. They pre-train the model on a large, combined corpus of all three languages. However, instead of creating a unified vocabulary that includes tokens from all three languages, they use three separate, language-specific vocabularies. What is the most direct and critical consequence of this design choice on the model's translation performance?

Updated 2025-09-28

Contributors are:

Who are from:

Learn Before

Related