1Cademy - Value Matrix for Causal Attention (V

Learn Before

Matrix as a Stack of Row Vectors
Value Matrix (V) in Attention

Definition

Value Matrix for Causal Attention (V_≤i)

In a causal attention mechanism, the value matrix for a given position $i$ , denoted as $\mathbf{V}_{\le i}$ , is formed by vertically stacking all value vectors from the beginning of the sequence up to and including position $i$ . This matrix represents the set of all values that can contribute to the output for the query at position $i$ . It is defined as: $\mathbf{V}_{\le i} = \begin{bmatrix} \mathbf{v}_0 \\ \vdots \\ \mathbf{v}_i \end{bmatrix}$