Short Answer

Comparing KV Cache Memory Growth

An auto-regressive language model is processing an extremely long document. Compare the growth of its Key-Value (KV) cache memory usage over time under two different scenarios: (1) a standard caching mechanism that stores all previous tokens, and (2) a windowed caching mechanism that only stores the most recent 1024 tokens. Explain the fundamental difference in their space complexity as the sequence length increases.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science