Concept

Compression of Key-Value Pairs for Compressive Memory

During the update process in the Compressive Transformer, the nsn_s key-value pairs that are popped from the primary memory (Mem\mathrm{Mem}) are not discarded. Instead, they are processed by a compression network, which compresses these nsn_s key-value pairs into a smaller set of nsc\frac{n_s}{c} key-value pairs before they are added to the compressive memory (CMem\mathrm{CMem}).

Image 0

0

1

Updated 2026-04-23

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences