Learn Before
Concept

Effect of Learning Rate Scheduling on Overfitting

Applying a learning rate scheduler to gently decrease the learning rate over the course of training can lead to improved model accuracy and less overfitting compared to using a constant learning rate. While the exact cause is debated, one theoretical explanation suggests that taking smaller step sizes forces the model parameters to remain closer to zero, resulting in a simpler model, although this does not completely explain the phenomenon.

0

1

Updated 2026-05-18

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L