Learn Before
A language model is designed to use both its recent conversational history (local memory) and relevant facts retrieved from a large knowledge base (retrieved memory). The chosen integration method is to simply concatenate the Key-Value pairs from both sources into a single, larger memory block before the attention mechanism processes them. What is the most significant architectural trade-off of this specific approach?
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A language model is designed to use both its recent conversational history (local memory) and relevant facts retrieved from a large knowledge base (retrieved memory). The chosen integration method is to simply concatenate the Key-Value pairs from both sources into a single, larger memory block before the attention mechanism processes them. What is the most significant architectural trade-off of this specific approach?
Analyzing Computational Cost of a Memory Integration Strategy
Evaluating a Memory Integration Method