Learn Before
Concept
Constraining MLPs for Images
When applying a multi-layer perceptron (MLP) to two-dimensional images, both the inputs and the hidden representations can be treated as matrices with spatial structure. To allow every hidden unit to receive input from every pixel, the network's parameters are represented as a fourth-order weight tensor and a bias matrix . The fully connected layer is formally expressed as:
where . A single fully connected layer mapping a pixel image to a hidden representation of the same size using this parametrization requires parameters, which is computationally intractable.
0
1
Updated 2026-05-09
Tags
D2L
Dive into Deep Learning @ D2L