Regularization Constant
The regularization constant, , is a nonnegative hyperparameter that characterizes the trade-off between standard prediction loss and a regularization penalty. It is typically fit using validation data. When , the original loss function is recovered, while larger values of constrain the model's weights more considerably. For example, in weight decay ( regularization), it modifies the objective to , where the penalty term is conventionally divided by 2 so that the constant cancels out gracefully when taking the derivative.
0
1
Contributors are:
Who are from:
Tags
Data Science
D2L
Dive into Deep Learning @ D2L
Related
Depth and Width for Neural Networks
Dropout
Neural Network Learning Rate
Epochs in Machine Learning
Activation Functions in Neural Networks
Deep Learning Optimizer Algorithms
Deep Learning Weight Initialization
Hyperparameters Tuning Methods in Deep Learning
Difference between Model Parameter and Model Hyperparameter
Regularization Constant
Batch Normalization
Frobenius and L2
Ridge Regression
What is weight decay?
Gaussian (Normal) Distribution
Regularization Constant
Insufficiency of Weight Decay for Preventing Interpolation
Lasso Regression
Laplace Distribution
Regularization Constant