1Cademy - Updating the KV Cache

Learn Before

Activity (Process)

Updating the KV Cache

The procedure for updating the Key-Value (KV) cache at a given position is an essential operation during autoregressive sequence generation. Specifically, at a new position $i'$ , the newly generated key vector ( $\mathbf{k}_{i'}$ ) and value vector ( $\mathbf{v}_{i'}$ ) are appended to their respective cache matrices, $\mathbf{K}$ and $\mathbf{V}$ . Using a function $\mathrm{Append}(\mathbf{a}, \mathbf{b})$ that adds a row vector $\mathbf{b}$ to a matrix $\mathbf{a}$ , the update rule is defined as $\mathbf{K} = \mathrm{Append}(\mathbf{K}, \mathbf{k}_{i'})$ and $\mathbf{V} = \mathrm{Append}(\mathbf{V}, \mathbf{v}_{i'})$ . This mechanism maintains a history of key-value pairs, enabling a Transformer decoder to attend to past context efficiently.

Updated 2026-05-03

Contributors are:

Who are from:

References

Learn Before

Related

Learn After