1Cademy - Calculating a Gated Attention Output

Learn Before

Gated Combination of Local and k-NN Attention

Case Study

Calculating a Gated Attention Output

A language model uses a learned gating mechanism to combine outputs from a local context attention (Att_local) and a k-NN retrieved context attention (Att_knn). The combination is performed using a learned gating vector g and the formula: Final_Output = g ⊙ Att_local + (1 - g) ⊙ Att_knn, where ⊙ denotes element-wise multiplication. Given the following vectors, calculate the Final_Output vector.

Updated 2025-10-08

Contributors are:

Who are from:

Learn Before

Related