Learn Before
VGG-11 Architecture
The original VGG network is commonly referred to as VGG-11 because it contains a total of eleven layers with learnable weights: eight convolutional layers and three fully connected layers. The convolutional feature extractor is constructed using five VGG blocks in sequence. The first two blocks contain one convolutional layer each, while the final three blocks contain two convolutional layers each. The network employs a strategy where the spatial dimensions are halved after each block while the number of feature channels doubles. Starting with output channels in the first block, the channels progressively double (, , ) until capping at in the final block, before the resulting feature map is flattened and fed into the fully connected dense layers.
0
1
Tags
D2L
Dive into Deep Learning @ D2L