Learn Before
Relation
Transformer models using Strided Patterns
These types of models deal with input sequences at fixed intervals.
Examples of Models using this technique are
Sparse Transformer (Child et al., 2019) and/or Longformer (Beltagy et al., 2020) employ strided or “dilated” windows.
0
1
Updated 2022-10-30
Tags
Data Science