Learn Before
Explaining the k-NN LM Rationale
A colleague is building a language model and suggests adding a mechanism to retrieve past, similar contexts from a large datastore to improve next-word prediction. Explain the fundamental assumption about the model's internal representations that must be true for this approach to be effective.
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Comprehension in Revised Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
The effectiveness of a certain retrieval-augmented language model relies on the principle that hidden states with high similarity are strong predictors of similar subsequent tokens. Which of the following scenarios presents the most significant challenge to the validity of this core principle?
Predicting Model Behavior from Hidden States
Explaining the k-NN LM Rationale