Learn Before
Classification

Classification of Sparse Attention Models by Definition of GG

Sparse attention models can be fundamentally distinguished by the method they use to define the set of attended-to indices, GG. The primary classification is based on whether GG is determined by token positions (Positional-based) or by token content (Content-based).

0

1

Updated 2026-04-22

Tags

Data Science

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related