Learn Before
Hybrid Cache Update Process
The update mechanism for a hybrid cache involves a continuous flow of information from a local, high-fidelity memory to a compressed, long-term memory. As new key-value pairs are generated, they are added to the local memory, which maintains a fixed-size window of the most recent items. This addition forces the oldest key-value pairs to be evicted from the local memory. These evicted pairs are then passed through a compression function before being stored in the compressed memory, which maintains a summarized history of older context.

0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Related
Compressive Transformer Memory Architecture
Hybrid Cache Update Process
A memory management strategy for processing long sequences involves two components: a 'local memory' that stores a fixed number of the most recent data points in their original form, and a 'compressed memory' that stores older data points in a summarized form after they are moved from the local memory. Which statement best analyzes the fundamental trade-off this two-level system is designed to address?
A new key-value pair is generated for a long sequence being processed by a model that uses a two-level hybrid memory system. Arrange the following events in the correct chronological order.
Hybrid Cache State Analysis
Consequences of Lossy Compression in a Hybrid Cache
Learn After
A memory system uses a two-level structure: a full, fixed-size local memory for recent items and a compressed memory for older items. When a new key-value pair is generated, several actions occur to update the system. Arrange the following actions in the correct chronological order.
Hybrid Cache State Transition
A memory system for handling long sequences uses a two-level cache: a fixed-size local memory for recent, high-fidelity items and a compressed memory for a summarized history of older items. When the local memory is full and a new key-value pair is added, the oldest pair is evicted. What is the immediate fate of this evicted key-value pair according to this design?