Learn Before
Concept
Applying Dropout After Activation
When implementing dropout in a neural network, the dropout operation is typically applied to the output of each hidden layer immediately following its non-linear activation function. This ensures that the neurons randomly zeroed out are the activated representations of the layer's output.
0
1
Updated 2026-05-07
Tags
D2L
Dive into Deep Learning @ D2L