Concept

Batch Normalization in Prediction Mode

Batch normalization layers function differently during the prediction (or test) mode compared to the training mode. During training, the normalization is based on the statistics of the current minibatch, which change with every model update. Once the model is fully trained, however, the exact means and variances of each layer's variables are calculated based on the entire dataset. In prediction mode, the model uses these fixed dataset-wide statistics to ensure that the same input is classified consistently, regardless of the batch it resides in.

0

1

Updated 2026-05-13

Tags

Data Science

D2L

Dive into Deep Learning @ D2L