Concept

Cumulative Average of Keys and Values for Memory Component

The moving average approach for creating memory summary vectors can be extended to a cumulative average of the keys and values. Instead of averaging over a fixed-size window of recent key-value pairs, this method extends the moving average to include all positions from the beginning of the sequence up to the current position ii. This results in a summary that incorporates the entire history of the sequence at each step.

Image 0

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related