Learn Before
Concept
Benefits of Batch Normalization
Batch normalization conveys three primary benefits during the training of deep networks: preprocessing, numerical stability, and regularization. First, similar to feature standardization, it puts parameters on a similar scale which is favorable for optimizers. Second, it provides numerical stability by preventing intermediate activations from taking widely varying magnitudes across layers and over time. Finally, the use of noisy estimates for the mean and variance injects noise into the optimization process, which acts as a serendipitous form of regularization that reduces overfitting.
0
2
Updated 2026-05-13
Contributors are:
Who are from:
Tags
Data Science
D2L
Dive into Deep Learning @ D2L