Concept

Basic Transposed Convolution Operation

In contrast to a regular convolution that reduces input elements via a kernel, the transposed convolution broadcasts input elements via the kernel, producing an output that is generally larger than the input. For a basic transposed convolution with a stride of 11 and no padding, an nhimesnwn_h imes n_w input tensor and a khimeskwk_h imes k_w kernel are used to produce an (nh+kh1)imes(nw+kw1)(n_h + k_h - 1) imes (n_w + k_w - 1) output tensor. For each element in the input tensor, it is multiplied by the entire kernel to produce a khimeskwk_h imes k_w tensor. This resulting tensor replaces a corresponding portion of an intermediate tensor initialized with zeros, based on the element's original position. The final output is formed by summing all these intermediate results. For example, if both the input tensor and the kernel are 2imes22 imes 2 matrices with values [[0,1],[2,3]][[0, 1], [2, 3]], the basic transposed convolution yields a 3imes33 imes 3 output tensor with values [[0,0,1],[0,4,6],[4,12,9]][[0, 0, 1], [0, 4, 6], [4, 12, 9]].

Image 0

0

1

Updated 2026-05-21

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L