Concept

Learning a Convolution Kernel from Data

Instead of manually designing kernel weights, a convolutional kernel can be learned directly from input-output data pairs. This is done by initializing the kernel as a random tensor within a convolutional layer, computing the layer's output for a given input, and using a loss function—such as squared error—to compare the output with the target. The gradient of this loss is then calculated and used to iteratively update the kernel weights via gradient descent, allowing the model to automatically discover the optimal filter for a given task.

0

1

Updated 2026-05-12

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L