1Cademy - Consequences of Bounded Memory in Text Summarization

Learn Before

Fixed-Size KV Cache for Long-Context Inference

Short Answer

Consequences of Bounded Memory in Text Summarization

A large language model is tasked with summarizing a very long document. To manage memory, it uses a Key-Value (KV) cache of a fixed size, which stores information from recently processed text. If the document's length is much greater than the cache's capacity, describe a specific, potential flaw that might appear in the generated summary and explain why this flaw occurs.

Updated 2025-10-06

Contributors are:

Who are from:

Learn Before

Related