1Cademy - Trade-off of Fixed-Size Global Memory

Learn Before

Global Tokens for Attention

Concept

Trade-off of Fixed-Size Global Memory

A primary drawback of using a fixed-size global memory, such as a set number of global tokens, is the potential for information loss. While this approach manages computational costs, the fixed capacity may be insufficient to adequately represent the full context of very long sequences, leading to a trade-off between efficiency and representational fidelity.

Updated 2025-10-06

Contributors are: