Learn Before
Concept
GoogLeNet Model Architecture
The GoogLeNet model is constructed from five sequential modules (labeled through ) followed by a fully connected output layer. The overall architecture diagram is shown in Fig. 8.4.2.
- Module (Stem): A convolutional layer with output channels, stride , and padding , followed by ReLU activation and a max-pooling layer (stride , padding ). This module resembles the stems of AlexNet and LeNet.
- Module : A convolution with channels, then a convolution that triples the channels to , each followed by ReLU, concluding with max-pooling (stride , padding ).
- Module : Two Inception blocks producing and output channels respectively, followed by max-pooling.
- Module : Five Inception blocks producing , , , , and output channels respectively, followed by max-pooling.
- Module : Two Inception blocks producing and output channels respectively, followed by global average pooling (reducing each channel to ) and a flatten operation.
Finally, a fully connected layer maps the -dimensional representation to the number of output classes.
0
1
Updated 2026-05-13
Tags
D2L
Dive into Deep Learning @ D2L
Related
1 x 1 Convolution Layer in Neural Networks
(Network ~ Network)Bottleneck Layer in Inception Network
Auxiliary Classifiers in Inception Network
Going deeper with convolutions paper
Effect of Inception
Features of GoogLeNet
Inception Block Structure
GoogLeNet Model Architecture
GoogLeNet Computational Efficiency Trade-off