GoogLeNet Channel Ratios in Inception Blocks
The output channels of each Inception block in GoogLeNet are partitioned across the four parallel branches, and the intermediate dimensionality-reduction ratios vary from block to block. In Module , the first Inception block outputs 256 channels (64 + 128 + 32 + 32) in a 2:4:1:1 ratio. The input (192 channels) is reduced by a factor of for the second branch (yielding 96 intermediate channels) and by for the third branch (yielding 16 intermediate channels). The second Inception block increases to 480 output channels (128 + 192 + 96 + 64) in a 4:6:3:2 ratio, with reduction factors of and yielding 128 and 32 intermediate channels. Across Modules , , and , the second branch (with the convolution) consistently produces the largest share of output channels, followed by the first branch (), the third branch (), and the fourth branch ( max-pooling). These ratios are slightly different in each Inception block.
0
1
Tags
D2L
Dive into Deep Learning @ D2L