Concept

Limitations of Direct Linear Regression for Classification

Treating multi-class classification directly as a vector-valued linear regression problem (minimizing the difference between raw outputs o\mathbf{o} and one-hot labels y\mathbf{y}) is unsatisfactory for probability estimation. First, there is no guarantee that the outputs will sum to 11, which is required for a valid probability distribution. Second, there is no guarantee that the outputs will be non-negative or bounded by 11. These limitations render the solution difficult to interpret probabilistically and brittle to outliers.

0

1

Updated 2026-05-03

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L

Learn After