Learn Before
Concept

Taxonomy of Efficient Transformers

Primary goal of an efficient transformer model is to improve the memory complexity of the self attention mechanism.The different methods or patterns that significantly improves the efficiency can be classified as shown below

  • Fixed Patterns (FP)
    • Blockwise Patterns
      • Strided Patterns
      • Compressed Patterns
  • Combination of Patterns (CP)
  • Learnable Patterns (LP)
  • Neural Memory
  • Low-Rank Methods
  • Kernel
  • Recurrence
  • Downsampling
  • Sparse Models and Conditional Computation

0

1

Updated 2022-10-30

Tags

Data Science