Learn Before
Concept

ResNeXt Block Structure

The ResNeXt block is a residual block design that incorporates grouped convolutions to process information efficiently across independent branches. It consists of a 3×33 \times 3 grouped convolution sandwiched between two 1×11 \times 1 convolutions. The first 1×11 \times 1 convolution reduces the channel dimensionality, while the second 1×11 \times 1 convolution restores it, effectively exchanging information across the gg independent groups. This design minimizes computational overhead: the network only pays an O(cb)\mathcal{O}(c \cdot b) cost for the 1×11 \times 1 kernels and an O(b2/g)\mathcal{O}(b^2 / g) cost for the 3×33 \times 3 grouped kernels, where cc is the number of channels and bb is the number of intermediate bottleneck channels. Similar to standard residual blocks, a 1×11 \times 1 convolution can be applied to the residual connection to match dimensions if needed.

Image 0

0

1

Updated 2026-05-13

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L