1Cademy - Input Embedding in Cross-Lingual Language Models

Learn Before

Cross-Lingual Language Models (XLM)

Concept

Input Embedding in Cross-Lingual Language Models

In the work of Lample and Conneau on cross-lingual language models, the input embedding for a specific token (denoted as $\mathbf{e}_i$ ) is calculated as the sum of its token embedding, positional embedding, and a language embedding. The inclusion of a language embedding requires assigning a language label to each token, which enables the model to distinguish between tokens from different languages.