Concept
Swin Transformer
Swin Transformers were developed as a general-purpose backbone network for computer vision to address the quadratic computational complexity of standard self-attention with respect to image size. By reinstating convolution-like priors, Swin Transformers extend the applicability of the Transformer architecture beyond basic image classification, achieving state-of-the-art results across a wide range of computer vision tasks.
0
1
Updated 2026-05-15
Tags
D2L
Dive into Deep Learning @ D2L