Learn Before
Concept

GoogLeNet Channel Ratios in Inception Blocks

The output channels of each Inception block in GoogLeNet are partitioned across the four parallel branches, and the intermediate dimensionality-reduction ratios vary from block to block. In Module b3b_3, the first Inception block outputs 256 channels (64 + 128 + 32 + 32) in a 2:4:1:1 ratio. The input (192 channels) is reduced by a factor of 12\frac{1}{2} for the second branch (yielding 96 intermediate channels) and by 112\frac{1}{12} for the third branch (yielding 16 intermediate channels). The second Inception block increases to 480 output channels (128 + 192 + 96 + 64) in a 4:6:3:2 ratio, with reduction factors of 12\frac{1}{2} and 18\frac{1}{8} yielding 128 and 32 intermediate channels. Across Modules b3b_3, b4b_4, and b5b_5, the second branch (with the 3×33 \times 3 convolution) consistently produces the largest share of output channels, followed by the first branch (1×11 \times 1), the third branch (5×55 \times 5), and the fourth branch (3×33 \times 3 max-pooling). These ratios are slightly different in each Inception block.

0

1

Updated 2026-06-14

Tags

D2L

Dive into Deep Learning @ D2L