1Cademy - A data scientist is building a language model to predict the next word in a sequence. The model estimates the probability of a word based on the four words that precede it, using counts from a massive text corpus. Despite the large training dataset, the model performs poorly on new sentences, frequently assigning a probability of zero to perfectly plausible word sequences. Which of the following statements best analyzes the fundamental reason for this failure?

Learn Before

Curse of Dimensionality in Traditional Language Models

Multiple Choice

A data scientist is building a language model to predict the next word in a sequence. The model estimates the probability of a word based on the four words that precede it, using counts from a massive text corpus. Despite the large training dataset, the model performs poorly on new sentences, frequently assigning a probability of zero to perfectly plausible word sequences. Which of the following statements best analyzes the fundamental reason for this failure?

Updated 2025-10-01

Contributors are:

Who are from:

Learn Before

Related