1Cademy - Generation of Query, Key, and Value Vectors in Self-Attention

Learn Before

Parameter Matrices for Attention Transformations

Concept

Generation of Query, Key, and Value Vectors in Self-Attention

In a self-attention layer, the Query (Q), Key (K), and Value (V) vectors are not the direct inputs themselves but are generated through linear transformations of the same input sequence. This input is typically the output from the preceding layer. Each vector in the input sequence is multiplied by three distinct weight matrices ( $W_q$ , $W_k$ , and $W_v$ ) to produce its corresponding Q, K, and V vectors.