Learn Before
CNN Computational Cost versus Parameter Count
Although convolutional neural networks typically contain fewer learnable parameters than comparably deep multi-layer perceptrons, they can still be more computationally expensive to train. The reason is parameter sharing: each convolutional kernel weight participates in many more multiplications because it is applied across every spatial position in the input feature map. As a result, the total number of floating-point operations can exceed that of a similarly structured MLP, even though the CNN stores far fewer unique weights. This makes GPU acceleration especially valuable when training CNNs, since the parallelism of GPUs is well-suited to the large number of repeated multiply–accumulate operations that convolutions require.
0
1
Tags
D2L
Dive into Deep Learning @ D2L
Related
Pros and Cons of CNN Architecture
Fully Connected Layer - Classification
3D Visualization of a Convolution Neural Network
Three classic networks
Convolution Filter (Kernel)
Pooling Layer in Convolutional Deep Learning
Example of a Convolutional Neural Network Architecture
ResNets Convolutional Neural Network
Classic Convolutional Neural Network Architectures for Object Detection in Images
Inception Network (GoogLeNet)
Architecture Design
Convolution and Pooling as an Infinitely Strong Prior
Convolutional Layer
CNN Computational Cost versus Parameter Count
Hierarchical Feature Learning in Vision Networks
Spatial Resolution Limit in CNNs
ParNet Architecture
Network in Network (NiN) Architecture
ShiftNet Architecture
Network Design Spaces