Concept

Neural Network as a Memory Component

The memory component, Mem\mathrm{Mem}, in an attention mechanism can be implemented as a neural network. This network functions recurrently, where at each step, it updates its state by taking its own previous output (the prior memory state) and the current states of the main model as inputs to generate the new memory output.

Image 0

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences