1Cademy - Pre-training Strategy for Zero-Shot Cross-Lingual Transfer

Learn Before

Bilingual Pre-training for Multilingual Models

Case Study

Pre-training Strategy for Zero-Shot Cross-Lingual Transfer

Based on the following scenario, which pre-training dataset should the research team prioritize to best achieve their goal? Justify your choice by explaining the mechanism through which the selected data helps the model develop the required cross-lingual capabilities.

Updated 2025-10-06

Contributors are: