1Cademy - Consider two language models that use an external datastore of (context -> next word) examples to help generate text. - **Model X** populates its datastore only with examples from the specific document it is currently generating. - **Model Y**s datastore is pre-filled with millions of examples from a vast and diverse library of texts before it begins generating any new document. When asked to complete a sentence about a niche historical fact not mentioned earlier in the current document, which model is more likely to perform better and why?

Learn Before

Extending k-NN Datastore Context with a Training Dataset

Multiple Choice

Consider two language models that use an external datastore of (context -> next word) examples to help generate text.

Model X populates its datastore only with examples from the specific document it is currently generating.
Model Y's datastore is pre-filled with millions of examples from a vast and diverse library of texts before it begins generating any new document.

When asked to complete a sentence about a niche historical fact not mentioned earlier in the current document, which model is more likely to perform better and why?

Updated 2025-09-26

Contributors are:

Who are from:

Learn Before

Related