Learn Before
Concept
Batch Normalization in Prediction Mode
Batch normalization layers function differently during the prediction (or test) mode compared to the training mode. During training, the normalization is based on the statistics of the current minibatch, which change with every model update. Once the model is fully trained, however, the exact means and variances of each layer's variables are calculated based on the entire dataset. In prediction mode, the model uses these fixed dataset-wide statistics to ensure that the same input is classified consistently, regardless of the batch it resides in.
0
1
Updated 2026-05-13
Contributors are:
Who are from:
Tags
Data Science
D2L
Dive into Deep Learning @ D2L