Learn Before
Concept
Overfitting with a Constant Learning Rate
When an optimization algorithm is run using a constant, un-decayed learning rate, the model often becomes prone to overfitting as training progresses. For example, if a modernized LeNet architecture is trained on Fashion-MNIST with a default constant learning rate of for iterations, the training accuracy will continue to rise while the test accuracy stalls after a certain point. The resulting gap between the training and test accuracy curves is a clear visual indicator of overfitting.
0
1
Updated 2026-05-18
Tags
D2L
Dive into Deep Learning @ D2L