Learn Before
Concept
VGG-11 Layer-by-Layer Shape Trace
We can trace the dimensionality transformations of an input image (e.g., with a spatial shape of ) as it passes through the VGG-11 network. The architecture halves the spatial height and width at each of the five VGG blocks due to the max-pooling operations. The resolution systematically drops from to , , , , and finally reaches . Meanwhile, the number of channels progressively expands up to . The resulting feature map is then flattened into a -dimensional representation before being fed into the fully connected dense layers.
0
1
Updated 2026-05-13
Tags
D2L
Dive into Deep Learning @ D2L