Concept
Translation Invariance
The principle of translation invariance in computer vision asserts that a network should recognize objects regardless of their location in an image. When applied to constrain a multi-layer perceptron (MLP), this principle dictates that a shift in the input must lead to an identical shift in the hidden representation . Consequently, the weight tensor and bias cannot depend on the absolute spatial coordinates . Using a constant bias and a shared set of weights , the hidden representation simplifies to:
This weight sharing dramatically reduces the parameter count (e.g., from to for a 1-megapixel image) and effectively forms a convolution.
0
1
Updated 2026-05-09
Tags
D2L
Dive into Deep Learning @ D2L