1Cademy - Order Preservation of the Softmax Function

Learn Before

Softmax Function

Formula

Order Preservation of the Softmax Function

The softmax function preserves the relative ordering among its input arguments because the exponential function is strictly monotonic. Consequently, the most likely class predicted by the softmax probabilities $\hat{\mathbf{y}}$ corresponds exactly to the largest raw output in $\mathbf{o}$ . This means we can determine the predicted class without actually computing the softmax normalization:

$\operatorname*{argmax}_j \hat y_j = \operatorname*{argmax}_j o_j$

Updated 2026-05-03

Contributors are:

Who are from:

References

Dive into Deep Learning

Learn Before

Related