1Cademy - Linear-Time Models for Transformers

Learn Before

Efficient Attention Models

Concept

Linear-Time Models for Transformers

Linear-time models represent a category of Transformer improvements or alternative architectures designed to overcome the performance bottleneck caused by quadratic time complexity. These models employ methods that scale linearly with sequence length, making them significantly more efficient for processing long sequences compared to the standard Transformer architecture.

Updated 2025-10-07

Contributors are: