Learn Before
LeNet-5 Convolutional Block
The convolutional encoder in LeNet-5 consists of two repeated units, each containing three operations: a convolutional layer, a sigmoid activation function, and an average pooling layer. Each convolutional layer uses a kernel. The first convolutional layer produces output channels, while the second produces . After each convolution and activation, a average pooling operation with stride halves both the height and width of the representation, reducing the spatial dimensionality by a factor of per pooling step. The convolutional block maps spatially arranged inputs to a progressively increasing number of two-dimensional feature maps while decreasing spatial resolution. Its output has shape (batch size, number of channels, height, width).
Note that while ReLU and max-pooling perform better in modern networks, they had not yet been discovered at the time LeNet was designed.
0
1
Tags
D2L
Dive into Deep Learning @ D2L
Related
LeNet-5 Convolutional Block
LeNet-5 Dense Block
LeNet-5 Layer-by-Layer Shape Trace
Softmax Replacement of Gaussian Activation in LeNet-5
Computational Scale Enabling Architectural Complexity
LeNet-5 Convolutional Neural Network Applications
LeNet-5 Convolutional Block
LeNet-5 Dense Block
LeNet-5 Layer-by-Layer Shape Trace
Modernized LeNet Architecture