Learn Before
Concept

Training VGG-11 on Fashion-MNIST

Because the standard VGG-11 architecture is computationally demanding, it is common to construct a modified version with a smaller number of channels when training on simpler datasets like Fashion-MNIST. Rather than starting with 6464 channels, the network can be instantiated with a reduced architecture configuration, such as starting with 1616 channels and doubling them progressively (e.g., 1616, 3232, 6464, 128128, 128128). This reduced-capacity network remains more than sufficient for the Fashion-MNIST classification task while significantly accelerating the training process and exhibiting only a small amount of overfitting.

0

1

Updated 2026-05-13

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L