Learn Before
Concept
Dropout Behavior at Extreme Probabilities
When evaluating a dropout layer implementation, its behavior varies predictably at boundary probabilities. Applying a dropout operation with a probability of leaves the input completely unchanged, as no elements are dropped. Using a dropout probability of drops approximately half of the elements and rescales the surviving ones by a factor of . Conversely, applying a dropout probability of drops all elements entirely, resulting in a tensor of zeros.
0
1
Updated 2026-05-07
Tags
D2L
Dive into Deep Learning @ D2L