Learn Before
ResNet Initial Layers
In the original ResNet architecture designed for ImageNet-scale images, the initial layers consist of a convolutional layer with output channels and a stride of , followed by batch normalization, a ReLU activation, and a max-pooling layer with a stride of . These large receptive fields and aggressive downsampling are appropriate for the pixel inputs common in ImageNet. However, when working with significantly smaller images (e.g., or pixels from datasets like Fashion-MNIST or CIFAR), these initial layers would reduce spatial dimensions too aggressively, leaving insufficient resolution for the subsequent residual blocks to extract meaningful features.
0
1
Tags
D2L
Dive into Deep Learning @ D2L
Related
Recent Variants of ResNets
Advantages of ResNets
Plain vs. ResNets Convolutional Neural Network Architectures
Evaluate ResNet at different depths for ImageNet Classification
Evaluate ResNet models with other state-of-the-art models for ImageNet Classification
Shortcut’s technique for identity mapping
Deep Residual Learning for Image Recognition
Residual Mapping
ResNet Initial Layers
Highway Networks vs. Residual Networks
Influence of Residual Connections on Subsequent Architectures
Adding Layers During Training in Residual Networks
Accelerated Forward Propagation in Residual Networks