Concept

Memory Efficiency of Recursive Cumulative Average

A significant advantage of computing the cumulative average for the memory component with a recursive formula is its memory efficiency. During inference, this method only requires storing the single key-value pair representing the previous state's average, rather than retaining the entire history of all key-value pairs.

0

1

Updated 2026-04-22

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences