1Cademy - Linear Combination of Local and External Attention

Learn Before

Integrating k-NN Memory with Local Memory in Attention

Example

Linear Combination of Local and External Attention

When incorporating both a local memory, $\mathrm{Mem}$ , and a retrieved long-term memory, $\mathrm{Mem}_{k\mathrm{nn}}$ , into a language model, one architectural approach is to process them in separate attention steps. As exemplified by the model developed by Wu et al. (2021), the outputs from the local attention mechanism and the external $k$ -NN attention mechanism can then be linearly combined to produce the final representation.

Updated 2026-04-23

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn Before

Related