Relation

Transformer models using Strided Patterns

These types of models deal with input sequences at fixed intervals.

Examples of Models using this technique are

Sparse Transformer (Child et al., 2019) and/or Longformer (Beltagy et al., 2020) employ strided or “dilated” windows.

0

1

Updated 2022-10-30

Tags

Data Science