Short Answer

Diagnosing Cross-Lingual Representation Issues

A developer is pre-training a single sequence-to-sequence model for translating between Spanish and Portuguese. They observe that the model struggles to find common patterns between the two languages, treating words like 'sol' (Spanish) and 'sol' (Portuguese), which both mean 'sun', as completely unrelated concepts. Explain why creating a unified vocabulary containing tokens from both languages is a critical step to resolve this issue and enable the model to learn shared representations.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science