Comparison

CNN Computational Cost versus Parameter Count

Although convolutional neural networks typically contain fewer learnable parameters than comparably deep multi-layer perceptrons, they can still be more computationally expensive to train. The reason is parameter sharing: each convolutional kernel weight participates in many more multiplications because it is applied across every spatial position in the input feature map. As a result, the total number of floating-point operations can exceed that of a similarly structured MLP, even though the CNN stores far fewer unique weights. This makes GPU acceleration especially valuable when training CNNs, since the parallelism of GPUs is well-suited to the large number of repeated multiply–accumulate operations that convolutions require.

0

1

Updated 2026-05-12

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L