Learn Before
Relation
Transformer models using Fixed Patterns
Sparsification of attention matrix by limiting the field of view. Field of view can be fixed , predefined patterns such as local windows and block patterns of fixed strides.
- Blockwise Patterns
- Strided Patterns
- Compressed Patterns
0
1
Updated 2022-10-30
Tags
Data Science
Related
Transformer models using Fixed Patterns
Transformer models using Combination of Patterns (CP)
Transformer patterns using Learnable patterns
Transformer models using Neural Memory
Transformer models using Low-Rank Methods
Transformer models using Kernels
Transformer models using Recurrence
Transformer models using Downsampling
Transformer models using Sparse Models and Conditional Computation