Concept
Learning a Convolution Kernel from Data
Instead of manually designing kernel weights, a convolutional kernel can be learned directly from input-output data pairs. This is done by initializing the kernel as a random tensor within a convolutional layer, computing the layer's output for a given input, and using a loss function—such as squared error—to compare the output with the target. The gradient of this loss is then calculated and used to iteratively update the kernel weights via gradient descent, allowing the model to automatically discover the optimal filter for a given task.
0
1
Updated 2026-05-12
Tags
D2L
Dive into Deep Learning @ D2L