1Cademy - Classification of Sparse Attention Models by Definition of $$G$$

Learn Before

Sparse Attention

Classification

Classification of Sparse Attention Models by Definition of $G$

Sparse attention models can be fundamentally distinguished by the method they use to define the set of attended-to indices, $G$ . The primary classification is based on whether $G$ is determined by token positions (Positional-based) or by token content (Content-based).