Learn Before
Concept

Benefits of Batch Normalization

Batch normalization conveys three primary benefits during the training of deep networks: preprocessing, numerical stability, and regularization. First, similar to feature standardization, it puts parameters on a similar scale which is favorable for optimizers. Second, it provides numerical stability by preventing intermediate activations from taking widely varying magnitudes across layers and over time. Finally, the use of noisy estimates for the mean and variance injects noise into the optimization process, which acts as a serendipitous form of regularization that reduces overfitting.

0

2

Updated 2026-05-13

Tags

Data Science

D2L

Dive into Deep Learning @ D2L