Theory
Probit Model
An alternative to the softmax function for modeling categorical probabilities is the probit model, which assumes that the raw outputs are corrupted versions of the true labels . This corruption is modeled by adding noise drawn from a normal distribution. Mathematically, it is expressed as , where . While conceptually appealing for explaining variance, the probit model is generally less effective and leads to a more difficult optimization problem compared to softmax regression.
0
1
Updated 2026-05-03
Tags
D2L
Dive into Deep Learning @ D2L