Concept

Applying Dropout After Activation

When implementing dropout in a neural network, the dropout operation is typically applied to the output of each hidden layer immediately following its non-linear activation function. This ensures that the neurons randomly zeroed out are the activated representations of the layer's output.

0

1

Updated 2026-05-07

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L