Concept

Regularization Constant

The regularization constant, λ\lambda, is a nonnegative hyperparameter that characterizes the trade-off between standard prediction loss and a regularization penalty. It is typically fit using validation data. When λ=0\lambda = 0, the original loss function is recovered, while larger values of λ\lambda constrain the model's weights more considerably. For example, in weight decay (2\ell_2 regularization), it modifies the objective to L(w,b)+λ2w2L(\mathbf{w}, b) + \frac{\lambda}{2} \|\mathbf{w}\|^2, where the penalty term is conventionally divided by 2 so that the constant cancels out gracefully when taking the derivative.

0

1

Updated 2026-06-14

Tags

Data Science

D2L

Dive into Deep Learning @ D2L