Learn Before
Predicting Model Behavior from Hidden States
Based on the high similarity between the two hidden states described in the case study, what can you infer about the next word the model is likely to predict in each case? Explain the core principle that justifies your inference.
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
The effectiveness of a certain retrieval-augmented language model relies on the principle that hidden states with high similarity are strong predictors of similar subsequent tokens. Which of the following scenarios presents the most significant challenge to the validity of this core principle?
Predicting Model Behavior from Hidden States
Explaining the k-NN LM Rationale