Concept

Spatial Resolution Limit in CNNs

The standard building block of early Convolutional Neural Networks consists of a convolutional layer with padding to maintain resolution, a nonlinearity (such as a ReLU), and a pooling layer (such as max-pooling) that reduces the spatial resolution. Because max-pooling typically halves the resolution in each block, this approach imposes a strict limit on network depth. Specifically, a network with an input dimension dd can accommodate at most log2d\log_2 d such convolutional layers before the spatial dimensions are entirely exhausted. For example, on ImageNet data, this architectural design restricts the network to a maximum of 88 convolutional layers.

0

1

Updated 2026-05-13

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L

Learn After