Definition

Value Matrix for Causal Attention (V_≤i)

In a causal attention mechanism, the value matrix for a given position ii, denoted as Vi\mathbf{V}_{\le i}, is formed by vertically stacking all value vectors from the beginning of the sequence up to and including position ii. This matrix represents the set of all values that can contribute to the output for the query at position ii. It is defined as: Vi=[v0vi]\mathbf{V}_{\le i} = \begin{bmatrix} \mathbf{v}_0 \\ \vdots \\ \mathbf{v}_i \end{bmatrix}

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.5 Inference - Foundations of Large Language Models