1Cademy - Fixed-Size Memory for Constant Attention Cost

Learn Before

Memory-Based Attention as a Form of Internal Memory

Concept

Fixed-Size Memory for Constant Attention Cost

If the memory component $\mathrm{Mem}$ used in the attention operation is defined as a fixed-size variable, the computational cost of performing the attention function $\mathrm{Att}(\mathbf{q}_i, \mathrm{Mem})$ will be fixed. By representing keys and values using this fixed-size memory model, the cost remains constant regardless of the sequence length. This foundational concept opens up several alternative ways to design the memory $\mathrm{Mem}$ .