Formula

Formula for Updating the Value Matrix in the KV Cache

During autoregressive inference, the Value matrix (V) in the KV cache is expanded at each step. The new value vector, vi\mathbf{v}_{i'}, corresponding to the current token, is appended to the existing matrix of values. This update operation is expressed by the formula: V=Append(V,vi)\mathbf{V} = \text{Append}(\mathbf{V}, \mathbf{v}_{i'})

0

1

Updated 2026-05-03

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences