Formula

Formula for Fixed-Size Window Memory

To construct the memory component, denoted as Mem\mathrm{Mem}, in a fixed-size window attention mechanism, a continuous subset of the most recent key and value vectors is extracted. Specifically, this slice spans from index inc+1i - n_c + 1 up to the current processing step ii, with ncn_c defining the capacity of the context window. The relationship is mathematically expressed as: Mem=(K[inc+1,i],V[inc+1,i])\mathrm{Mem} = (\mathbf{K}_{[i - n_c + 1, i]}, \mathbf{V}_{[i - n_c + 1, i]}).

Image 0

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related