Learn Before
Concept
Training VGG-11 on Fashion-MNIST
Because the standard VGG-11 architecture is computationally demanding, it is common to construct a modified version with a smaller number of channels when training on simpler datasets like Fashion-MNIST. Rather than starting with channels, the network can be instantiated with a reduced architecture configuration, such as starting with channels and doubling them progressively (e.g., , , , , ). This reduced-capacity network remains more than sufficient for the Fashion-MNIST classification task while significantly accelerating the training process and exhibiting only a small amount of overfitting.
0
1
Updated 2026-05-13
Tags
D2L
Dive into Deep Learning @ D2L