Learn Before
  • Language Diversity in LLM Training

Challenges of Multilingual LLMs for Low-Resource Languages

While training LLMs on multilingual data is a powerful approach, a model's performance in a specific language is highly contingent on the volume and quality of the data for that language in the training set. This dependency often results in poor performance for low-resource languages, for which extensive, high-quality data is typically unavailable.

0

1

18 days ago

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related
  • Challenges of Multilingual LLMs for Low-Resource Languages