Concept

Mechanics of Dropout on a Hidden Layer

When dropout is applied to a hidden layer of a neural network, each hidden unit is zeroed out with a specified probability pp. This effectively creates a sub-network containing only a subset of the original neurons. Consequently, the forward calculation of the outputs and the backward calculation of gradients during backpropagation no longer depend on the dropped nodes. This mechanism ensures that the output layer cannot become overly dependent on any single hidden unit.

Image 0

0

1

Updated 2026-05-07

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L