Learn Before
Receptive Field and Parameter Efficiency of Stacked Convolutions
Stacking multiple small convolutions provides the same receptive field as a single larger convolution while improving parameter efficiency and increasing depth. For example, applying two successive convolutions touches the same input pixels as a single convolution. However, while a single convolution requires parameters (where is the number of channels), three successive convolutions use a comparable number of parameters () while providing an even larger receptive field. This analysis demonstrated that deep and narrow networks significantly outperform their shallow and wide counterparts, establishing stacked convolutions as a gold standard in deep learning architecture design.
0
1
Tags
D2L
Dive into Deep Learning @ D2L