Learn Before
Language Diversity in LLM Training
Challenges of Multilingual LLMs for Low-Resource Languages
While training LLMs on multilingual data is a powerful approach, a model's performance in a specific language is highly contingent on the volume and quality of the data for that language in the training set. This dependency often results in poor performance for low-resource languages, for which extensive, high-quality data is typically unavailable.
0
1
18 days ago
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Related
Challenges of Multilingual LLMs for Low-Resource Languages