Learn Before
Concept

GoogLeNet Channel Ratios in Inception Blocks

The output channels of each Inception block in GoogLeNet are carefully partitioned across the four parallel branches, and the intermediate dimensionality-reduction ratios vary from block to block.

In Module b3b_3, the first Inception block outputs 64+128+32+32=25664+128+32+32=256 channels in a 2:4:1:12:4:1:1 ratio. The input (192192 channels) is reduced by a factor of rac{1}{2} for the second branch (yielding 96=192/296 = 192/2 intermediate channels) and by rac{1}{12} for the third branch (yielding 16=192/1216 = 192/12 intermediate channels). The second Inception block increases to 128+192+96+64=480128+192+96+64=480 output channels in a 4:6:3:24:6:3:2 ratio, with reduction factors of rac{1}{2} and rac{1}{8} yielding 128128 and 3232 intermediate channels.

Across Modules b3b_3, b4b_4, and b5b_5, the second branch (with the 3imes33 imes 3 convolution) consistently produces the largest share of output channels, followed by the first branch (1imes11 imes 1), the third branch (5imes55 imes 5), and the fourth branch (3imes33 imes 3 max-pooling). These ratios are slightly different in each Inception block.

0

1

Updated 2026-05-13

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L