Learn Before
Concept

VGG-11 Architecture

The original VGG network is commonly referred to as VGG-11 because it contains a total of eleven layers with learnable weights: eight convolutional layers and three fully connected layers. The convolutional feature extractor is constructed using five VGG blocks in sequence. The first two blocks contain one convolutional layer each, while the final three blocks contain two convolutional layers each. The network employs a strategy where the spatial dimensions are halved after each block while the number of feature channels doubles. Starting with 6464 output channels in the first block, the channels progressively double (128128, 256256, 512512) until capping at 512512 in the final block, before the resulting feature map is flattened and fed into the fully connected dense layers.

0

1

Updated 2026-05-13

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L