1Cademy - LeNet-5 Convolutional Block

Learn Before

Concept

LeNet-5 Convolutional Block

The convolutional encoder in LeNet-5 consists of two repeated units, each containing three operations: a convolutional layer, a sigmoid activation function, and an average pooling layer. Each convolutional layer uses a $5 imes 5$ kernel. The first convolutional layer produces $6$ output channels, while the second produces $16$ . After each convolution and activation, a $2 imes 2$ average pooling operation with stride $2$ halves both the height and width of the representation, reducing the spatial dimensionality by a factor of $4$ per pooling step. The convolutional block maps spatially arranged inputs to a progressively increasing number of two-dimensional feature maps while decreasing spatial resolution. Its output has shape (batch size, number of channels, height, width).