Learn Before
Concept

Overfitting with a Constant Learning Rate

When an optimization algorithm is run using a constant, un-decayed learning rate, the model often becomes prone to overfitting as training progresses. For example, if a modernized LeNet architecture is trained on Fashion-MNIST with a default constant learning rate of 0.30.3 for 3030 iterations, the training accuracy will continue to rise while the test accuracy stalls after a certain point. The resulting gap between the training and test accuracy curves is a clear visual indicator of overfitting.

Image 0

0

1

Updated 2026-05-18

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L