1Cademy - Dimensional Analysis of the Attention Formula

Learn Before

General Attention Formula

Short Answer

Dimensional Analysis of the Attention Formula

An attention mechanism operates on a Query matrix $\textbf{Q}$ with dimensions $10 \times 64 $, a Key matrix$ \textbf{K} $with dimensions $20 \times 64$ , and a Value matrix $\textbf{V}$ with dimensions $20 \times 128 $. According to the general formula$ Att(\textbf{Q}, \textbf{K}, \textbf{V}) = \alpha(\textbf{Q}, \textbf{K})\textbf{V}$, what will be the dimensions of the final output matrix? Explain the steps to arrive at your answer.

Updated 2025-10-04

Contributors are:

Who are from:

Learn Before

Related