Learn Before
Formula
Layer Normalization
Layer normalization is a technique that standardizes the activations of a deep network by applying the normalization to one observation at a time, rather than across a minibatch. For an -dimensional input vector , the layer normalization operation is defined as: where the scalar mean and the scalar variance are computed across the features of the single observation: A small constant is added to prevent division by zero. Because it operates on a single observation, both the offset and the scaling factor in layer normalization are scalars.
0
1
Updated 2026-05-13
Tags
D2L
Dive into Deep Learning @ D2L