Learn Before
Concept
Limitations of Direct Linear Regression for Classification
Treating multi-class classification directly as a vector-valued linear regression problem (minimizing the difference between raw outputs and one-hot labels ) is unsatisfactory for probability estimation. First, there is no guarantee that the outputs will sum to , which is required for a valid probability distribution. Second, there is no guarantee that the outputs will be non-negative or bounded by . These limitations render the solution difficult to interpret probabilistically and brittle to outliers.
0
1
Updated 2026-05-03
Tags
D2L
Dive into Deep Learning @ D2L