Concept

Segment-based Operation in Compressive Transformer

The Compressive Transformer, like other segment-level recurrence models, processes sequences by dividing them into segments. Each segment consists of a fixed number of consecutive tokens, denoted as nsn_s. The model operates on the key-value pairs corresponding to the tokens of the kk-th segment, which are represented as SkvkS_{\mathrm{kv}}^{k}.

Image 0

0

1

Updated 2026-04-23

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related