1Cademy - General Attention Formula

Learn Before

Value Matrix (V) in Attention

Formula

General Attention Formula

The general attention mechanism maps a set of queries, keys, and values to an output. This output is calculated as a weighted sum of the value vectors, where the weights are determined by a compatibility function between the queries and keys. The matrix form of this operation is: $Att_{qkv}(\textbf{Q}, \textbf{K}, \textbf{V}) = \alpha(\textbf{Q}, \textbf{K})\textbf{V}$ . In this formula, $\textbf{Q}$ , $\textbf{K}$ , and $\textbf{V}$ are the query, key, and value matrices, respectively. The term $\alpha(\textbf{Q}, \textbf{K})$ represents the attention weight matrix, which has dimensions of $m \times m$ , where $m$ is the sequence length.

0

1

Updated 2026-05-14

Contributors are:

Who are from:

References

Learn Before

Related

Learn After